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DNA encoding ovine adenovirus (OAV 287) and its use as a viral vector 

Technical Field 

The present invention relates to a new full length 
genomic clone derived from a benign adenovirus (OAV287) 
5 isolated from sheep in Australia, The present invention 
also relates to new viral vectors derived from the benign 
ovine adenovirus and also relates to the use of these 
vectors for the delivery and expression of nucleic acid 
sequences encoding functional RNA molecules or 
10 polypeptides to animals, 

Pacfrgrovind <?f the ipventipn 

Diseases caused by infectious agents and parasite 
infestations cause health problems and production losses 
in domestic animals but for many infectious agents no 

15 vaccine exists. Consequently, there are major research 
efforts worldwide to develop new vaccines which can 
protect against disease. 

While some protective antigens from infectious 
agents and parasites have been identified, their 

20 successful use as vaccines requires the development of 

systems which can effectively deliver the antigen to the 
host. A variety of recombinant gene expression vectors 
derived principally from the pox virus family have been 
employed as these are generally of low pathogenicity. 

25 Expression of the foreign protein following infection by 
the recombinant viral vector may stimulate a protective 
immune response in the host. 

However, no viral vector has all the attributes 
desirable for all situations. Some vectors are better 

30 suited to particular tasks than others because of their 
biological properties. For example, it has often proved 
difficult to stimulate an effective mucosal immune 
response which can protect against disease. In humans, 
adenoviruses have been given orally to vaccinate against 

35 respiratory disease (1). As this involves protection at 
mucosal surfaces adenoviruses clearly have potential in 
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this regard. Human adenovirus vectors have also been used 
to deliver genes to muscle (2) and other tissues. 
Although adenoviruses do not generally integrate their DNA 
into the cellular genome, nevertheless, the DNA persists 
5 and long term protein expression is observed. Expression 
of an appropriate antigen from such cells can generate a 
systemic immune response which may be protective against 
the homologous disease causing agent. 

Known adenovirus genomes are linear double-stranded 

10 DNA molecules which have an inverted terminal repeat 

sequence (ITR) at each end and a protein covalently bound 
to the 5 '-terminal C residue (3). The genome sequence and 
structure has now been completely determined for human 
adenoviruses types 2, 5, 12 and 40 and partially for 

15 numerous others, including some animal isolates (see 
Genebank and EMBL Nucleic Acid databases). Human 
adenovirus type 2 was the first genome to be sequenced but 
broadly speaking its genome arrangement is conserved among 
other characterized adenoviruses i.e. early regions E1-E4 

20 and the structural protein homologues can be recognized in 
similar locations in the genome. In particular, the 
E1A/E1B region is located at the left hand end of the 
genome and region E4 is always located at the right hand 
end of the genome. Early region E3 is always located 

25 between the genes for structural proteins pVIII and fiber, 
although its size and complexity varies between species 
e.g. from 3kb with at least 10 open reading frames in 
human adenoviruses to approximately 0.7kb with only two 
significant open reading frames in murine adenovirus (4, 

30 5). E3 is a key region for the construction of 

recombinant viruses as it is non-essential for replication 
in vitro (6). The late, L region is expressed from the 
major late promoter, MLP and complex splicing generates 
families of mRNAs which code for most of the structural 

35 viral proteins. Proteins IVa2 and IX appear to have their 
own promoters . 
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Although there are some human viral vectors 
available for medical use there are few animal viral 
vectors suitable for use in veterinary applications. In 
order to obtain a more suitable animal viral vector the 
5 present inventors have purified an ovine adenovirus 

(OAV2 87) isolated from sheep in Western Australia. This 
ovine adenovirus is serologically related to bovine 
adenovirus type 7 but is genetically distinct from the 
bovine adenoviruses and other Australian ovine isolates, 

10 as shown by comparisons between the ovine and bovine 

adenoviruses, based on restriction enzyme profiles (8). 
The genome arrangement of the virus according to the 
present invention varies significantly from all other 
known adenoviruses. The adenoviral DNA molecule of the 

15 present invention is suitable for use in viral vectors 

capable of expressing a variety of polypeptides when used 
for veterinary applications. 
Summary of the Invention 

According to a first aspect, the present invention 

20 consists in an isolated DNA molecule comprising a nucleic 
acid sequence encoding the genome of ovine adenovirus 
(OAV287) substantially as shown in Figure 1 or a 
functionally equivalent nucleic acid sequence. 
Preferably, the nucleic acid sequence encoding the genome 

25 of the adenovirus is substantially as shown in Figure 1. 

In a further preferred embodiment of the first 
aspect of the present invention, the DNA molecule 
comprises a nucleic acid sequence encoding the genome of 
ovine adenovirus (OAV287) wherein a portion of the 

30 adenoviral genome not essential for the maintenance or 
viability of the native adenovirus deleted or altered. 

In a second aspect, the present invention consists 
in a DNA molecule including at least a fifteen nucleic 
acid base sequence being substantially unique to the ovine 

35 adenovirus (OAV2 87) nucleic acid sequence shown in Figure 
1. In a preferred embodiment of the second aspect of the 
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present invention, the at least fifteen nucleic acid base 
sequence encodes a functional element of ovine adenovirus 
(OAV287). Preferably, the functional element is selected 
from the group consisting of promoter, gene, inverted 
5 terminal repeat, viral packaging signal and RNA processing 
signal. The inverted terminal repeat of ovine adenovirus 
(OAV2 87) comprises the first 4 6 nucleic acid bases from 
the 5' end of each strand of the double stranded DNA 
genome of the virus. 

10 In a third aspect, the present invention consists in 

a plasmid including the DNA molecule of the first or 
second aspects of the present invention. Preferably, the 
plasmid includes the DNA molecule of the first aspect of 
the present invention wherein the nucleic acid sequence 

15 encoding the adenoviral genome is linked to a nucleic acid 
sequence encoding an origin of replication and a further 
nucleic acid encoding a marker. Preferably, the nucleic 
acid sequence encoding the marker encodes for resistance 
to an antimicrobial agent. More preferably the 

2 0 antimicrobial agent is ampicillin. 

In a further preferred embodiment of the third 
aspect of the present invention, sequences encoding 
inverted terminal repeats of the adenovirus are joined. 

In a fourth aspect, the present invention consists 

25 in a viral vector comprising the DNA molecule of the first 
aspect of the present invention and at least one nucleic 
acid sequence encoding a non-adenoviral polypeptide or 
polypeptides . 

Preferably, nucleic acid sequence encoding the non- 
30 adenoviral polypeptide or polypeptides is derived from 
bacteria, viruses, parasites or eukaryotes . More 
preferably, the non-adenoviral polypeptide is rotavirus 
VP7sc antigen, the parasite polypeptide is 
Trichostrongylus colubriformis 17kD antigen, the 
35 Taenia ovis 4 5W antigen or the PM95 antigen from Lucilla 
cuprina . 
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In another form, the present invention consists in a 
viral vector comprising the DNA molecule of the first 
aspect of the present invention and at least one nucleic 
acid sequence encoding a functional RNA molecule. It will 
5 be appreciated by one skilled in the art that a functional 
RNA molecule can include a messenger RNA molecule, an 
antisense RNA molecule or a ribozyme. 

In a fifth aspect, the present invention consists in 
a method of delivering a DNA molecule having a nucleic 

10 acid sequence encoding a non-adenoviral polypeptide or 
polypeptides to a target cell comprising infecting the 
target cell with a viral vector according to the fourth 
aspect of the present invention such that the DNA molecule 
encoding the polypeptide or polypeptides is expressed and 

15 the polypeptide or polypeptides is produced by the target 
cell . 

In a sixth aspect, the present invention consists in 
a method for delivering a DNA molecule having a nucleic 
acid sequence encoding a non-adenoviral polypeptide or 

20 polypeptides to an animal comprising administering to the 
animal a viral vector according to the fourth aspect of 
the present invention such that the viral vector infects 
at least one cell of the animal and the infected cell 
expresses the DNA molecule encoding the polypeptide or 

25 polypeptides and produces the polypeptide or polypeptides. 
Preferably the animal is a grazing animal and more 
preferably the grazing animal is a sheep. 

In another form, the present invention consists in a 
method for delivering a DNA molecule having a nucleic acid 

30 sequence encoding a functional RNA molecule to an animal 
comprising administering to the animal a viral vector of 
the fourth aspect of the present invention having a 
nucleic acid sequence encoding a functional RNA molecule 
such that the viral vector infects at least one cell of 

35 the animal and the infected cell expresses the DNA 
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molecule encoding the functional RNA molecule and produces 

the functional RNA molecule. 

As used herein the term "functionally equivalent 

nucleic acid sequence" is intended to cover minor 
5 variations in the ovine adenovirus (OAV287) DNA molecule 

which, due to degeneracy in the DNA code, does not result 

in the molecule encoding different viral polypeptides. 

Further, this term is intended to cover alterations in the 

DNA code which lead to changes in the encoded 
10 polypeptides, but in which such changes do not 

substantially affect the biological activities of these 

viral polypeptides . 

As used herein the term "functional element" is 

intended to cover nucleic acid sequences that encode 
15 promoters, genes, inverted terminal repeats, viral 

packaging signals and RNA processing signals. It will be 

appreciated by one skilled in the art that unique 

sequences from ovine adenovirus (OAV287) that encode these 

functional elements may be useful in other systems 
2 0 including plasmids and non-ovine adenoviral vectors. 

In order that the nature of the present invention 

may be more clearly understood preferred forms thereof 

will be described with reference to the following examples 

and the accompanying drawings . 
25 Brief Des cription of the Drawings 

Figure 1 is the nucleic acid sequence of the OAV2 87 

genome beginning at base 1 of the left-hand ITR. 

Figure 2 shows the arrangement of OAV28 7 genes based 

on homologies detected with Ad2 . Regions with question 
30 marks are tentative identifications because of the lack of 

obvious homology. 

Figure 3 indicates the major open reading frames in 

the proposed El region of OAV287. Asterisks show the 

location of possible initiation codons . A previously 
35 unidentified gene (p28kD) which codes for a processed 

structural protein is encoded on the complementary strand. 
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Figure 4 shows open reading frames in the region of 
the OAV287 expected to contain E3. However, E3 is missing 
as the gap between the pVIII and fiber genes is only 197 
nucleotides* The site at which the Apal/NotI polylinker 
5 was later inserted is indicated. 

Figure 5 shows the major open reading frames in the 
probable E3 region of OAV287 . Asterisks show the location 
of potential initiation codons . The Sail site which was 
modified by end-filling and re-ligation and the 
10 alternative site at which a polylinker sequence was later 
inserted into the genome without loss of infectivity is 
indicated. 

Figure 6 is a scheme describing the construction of 
a plasmid (pOAV2 87Cm) containing a full-length clone of 

15 the OAV287 genome with pACYC184 sequences inserted in the 
Sail site. Filled in regions show OAV2 87 sequences. 
Cross-hatched sequences are derived from plasmids pUC13 or 
Bluescribe M13+ (Amp R ) , stippled regions from pSELECT 
(Tet R ) and open regions from pACYC184 (Cm R ) . Only the key 

20 restriction sites used for plasmid construction are 
indicated. 

Figure 7 shows a map of the plasmids pOAVlOO, 
pOAV200, pOAV600 and pOAVGOOS. Arrowheads indicate the 
ITRs and the approximate location of the major late 

25 promoter (MLP) . The mutated Sail site and sites at which 
the Apal/NotI polylinker sequences were inserted are 
indicated. Light hatching signifies modified Bluescribe 
sequences inserted in the Kpnl site. Linear, infectious 
genomes (dark hatching) are released by digestion with 

30 Kpnl. 

Figure 8 shows the results of screening ovine 
adenoviruses OAV100 and OAV200 rescued by transfection of 
recombinant plasmids pOAVlOO and pOAV200 into CSL503 
cells. Portions of the genome spanning (A) the mutated 
35 SphI site in OAV100 and (B) the Apal/EcoRV/NotI polylinker 
insertion site in OAV200 were amplified by PCR together 
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with the corresponding regions from wild-type OAV287. The 
products were digested with SphI (A, lanes 3 & 5) and 
Apal, EcoRV or NotI (B, lanes 3-5, and 8-10, 
respectively). (U) indicates undigested samples. 
5 Figure 9 is a map of a plasmid pMT used for the 

assembly of gene expression cassettes. Fragments 
containing the OAV287 major late promoter and tripartite 
leader sequences are linked and precede a multiple cloning 
site for the insertion of genes of interest. A tandem 

10 polyadenylation signal (AATAAA) follows. 

Figure 10 shows a summary of recombinant viruses 
which have been rescued from the corresponding infectious 
plasmids and the gene expression cassettes they carry. 
Cassettes were inserted into the OAV genome between the 

15 pVIII and fibre genes as indicated. 

Figure 11 shows the expression of (A) the T. ovis 
45W and L. cuprina PM95 antigens in CSL503 cells following 
infection of these cells with OAV205 and OAV210 viruses, 
respectively and (B) VP7sc expression in CSL503 and bovine 

20 nasal turbinate cells following infection with virus 

OAV204. (I) Infected cells (U) Uninfected cells. (M) 
indicates marker proteins of the sizes shown. 

Figure 12 shows expression of VP7sc in (A) CSL503 
cells and (B) rabbit kidney and bovine nasal turbinate 

25 cells following infection with OAV206 virus. (I) Infected 
cells. (U) uninfected cells. (M) indicates marker proteins 
of the sizes shown. 
■ Description of the Invention 
METHODS 

30 Growth and Purification of OAV28 7 

The virus, isolated from sheep in 1985 , was obtained 
from R.L. Peet, Animal Health Laboratory, Department of 
Agriculture, Western Australia. The virus isolate was 
grown in sheep foetal lung cells (line CSL503) and twice 

35 plaque-purified under solid overlay before stocks were 
prepared. Virus was purified from CSL503 cells as 
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described previously (18, 22). DNA was extracted from the 
virus by digestion with proteinase K (23). 
Cloning of Genome Fragments 

Molecular techniques for manipulation, modification 
5 and transformation of plasmid DNA which were used in the 
work described below are described in (9) and similar 
publications. OAV287 DNA was digested with various 
restriction endonucleases including BamHI, SphI, Smal and 
Sail to deduce the location of these sites (18). 

10 The adenovirus genome has a protein covalently 

linked to each end of the linear dsDNA (24). The BamHI A 
and D fragments of approximately 8kb and 4kb, 
respectively, were identified as the terminal genomic 
fragments because their migration into agarose gels was 

15 dependent on the pre-digestion of viral DNA with 

proteinase K. The internal BamHI fragments B, C, E and F, 
estimated at 6.2, 5.1, 3.4 and 1 . lkb in size respectively, 
were separated on an agarose gel, recovered and cloned 
into BamHI-digested pUC13 using standard ligation and 

20 transformation procedures (9). To clone the terminal 
BamHI A and D fragments, viral DNA (10|ig) was digested 
with proteinase K (50jag/ml in lOmM Tris/HCL, pH8.0, 
containing ImM EDTA and 0.5% SDS) at 65°C for 60min to 
remove the terminal protein. The DNA was extracted twice 

2 5 with phenol /chloroform, once with ether and recovered by 
ethanol precipitation. The 3 'ends (of unknown sequence) 
were then digested exo-nucleolytically with T4 DNA 
polymerase (5 units, Toyobo, Tokyo, Japan) in the presence 
of dATP (100|iM) in buffer containing Tris HCL (50mM), 

30 pH8.0, MgCl2 (7mM), 2-mercaptoethanol (7mM) and BSA 

(lOfig/ml) for 15min at 37°C. The DNA was again purified 
by phenol extraction and ethanol precipitation described 
above. To remove the single-stranded terminal regions and 
create blunt ends the DNA was digested with 1 unit of mung 

35 bean nuclease (Pharmacia, North Ryde, Australia) for 10 

min at 37°C in buffer containing Na acetate (30mM), pH4 . 6 , 
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NaCl (50mM) and ZnCl2 (ImM) before extraction with 
phenol/chloroform and recovery by ethanol precipitation. 
Finally the DNA was digested with BamHI (Pharmacia) and 
the fragments were separated by electrophoresis in low- 
5 melting-point agarose. The BamHI A and D fragments were 
excised, recovered by NACS column chromatography (BRL, 
Gaithersburg, Md) and ligated with BamHI/HincII-cut 
plasmid Bluescribe M13 + (Stratagene, La Jolla, Ca) prior 
to transformation into E. coli JM109. Positive clones 

10 carrying fragments of the expected size were identified, 
restriction digested and confirmed as correct by 
nucleotide sequencing and comparison with partial sequence 
determined directly from genomic DNA. This revealed that 
three 3 ' -terminal nucleotides were removed during the 

15 cloning procedure. 

Nucleotide Sequencing of the OAV2 87 Genome 

The complete sequence of the OAV287 genome was 
determined by sequencing the BamHI fragments A-F using the 
Sanger method (25) and various kits provided by commercial 

20 suppliers. Nested deletions were constructed for the five 
largest fragments using a double- stranded nested deletion 
kit (Pharmacia). These were sequenced using standard 
primers . Based on newly determined sequence other 
nucleotide primers were synthesised using a DNA 

25 synthesizer (AB1, Model 391). In this way both strands of 
the entire genome and the junctions between the fragments 
were sequenced. 

Mutagenesis of the OAV287 genome 

For the construction of a full length OAV287 clone 

30 and subsequent modification of it to create plasmids such 
as pOAV200 and pOAV600 certain mutations were required. A 
relevant portion of the genome was subcloned into 
Bluescribe (Stratagene, La Jolla, Ca) or a similar plasmid 
which allowed rescue of single stranded DNA. Later it 

35 became possible to use dsDNA for mutagenesis. 

Oligonucleotides of the desired sequence were synthesized, 
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phosphorylated and used as primers as described by the 
manufacturers of Muta-gene Phagemid (Biorad Labs, Ca) or 
Altered sites II (Promega, Wi) mutagenesis kits. 
Mutations were generally identified by digestion with the 
5 appropriate restriction enzyme or by nucleotide 

sequencing, or both. Genome fragments containing 
introduced mutations were subcloned to create larger 
plasmids such as pOAV200 using appropriate unique 
restriction sites . 

10 Construction of a Full-Length Genomic Clone of OAV287 
The terminal BamHI A and D fragments (cloned in 
Bluescribe MIS 4 ') were each modified by mutagenesis to add 
the nucleotides lost during cloning and a Kpnl site. The 
last base of the Kpnl site incorporated the C at the 5' 

15 end of each genomic ITR sequence . This produced plasmids 
pAK and pDK (Figure 6). 

The left hand approximately 21.5kb of the genome was 
constructed from the BamHI D and B fragments and the SphI 
A fragment of approximately 13kb. The genomic BamHI B 

20 fragment cloned in pUC13 was modified by mutagenesis 

(GCATGC to GCATCC) to remove the SphI site at position 
8287 producing pUC13B. The modified fragment was released 
by BamHI digestion and cloned into pDK which had been cut 
with BamHI and dephosphorylated . Colonies carrying the 

25 recombinant plasmid pDBM (Figure 6) were identified by 

screening with an oligonucleotide which spanned the BamHI 
B/D junction. The SphI A fragment (approximately 13kb) 
was cloned into the SphI site of pSELECT (Promega) to form 
pSESPH. This fragment contains a Smal site near its left 

30 hand end which is common to pDBM. The Kpnl /Smal fragment 
from pDBM was subcloned into pSESPH which had also been 
cut with Kpnl /Smal to produce pSELLH, a plasmid based on 
pSELECT which now contained the left-hand approximately 
21.5kb of OAV287 DNA. 

35 The right-hand end of the genome was constructed 

from pAK which contains the right-hand approximately 8.6kb 
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of the genome and overlaps the SphI A fragment- pAK was 
cut with Sail and ligated with Sail-cut pACYC184, a 
plasmid of 4.24kb which contains a gene encoding 
chloramphenicol (Cm) resistance and an origin for DNA 
5 replication, to form a pACm (Figure 6). This plasmid was 
cut with SphI and Kpnl to produce the right-hand genomic 
fragment incorporating the pACYC184 sequences. This was 
ligated with the left-hand KpnI/SphI fragment of 
approximately 21.5kb prepared from pSELLH to produce the 
10 final plasmid pOAV287Cm (Figure 6). This plasmid 

replicates stably in E. coli and therefore removes the 
need to propagate the virus to obtain genomic DNA for 
further study. The recombinant genome in plasmid 
pOAV2 87Cm differs from the wild- type viral genome by the 
15 single point mutation in the SphI site (base 8287), by the 
presence of pACYC184 sequences in the Sail site and by the 
addition of a GTAC sequence between the ITRs . However, 
insertion of pACYC184 sequences in the Sail site disrupts 
two significant open reading frames whose functions are 
20 unknown. If either of the gene products was essential for 
replication, then pOAV287Cm could not produce infectious 
virus following transf ection . To circumvent this 
potential problem pOAV2 87Cm was modified further. First, 
plasmid Bluescribe M13- (Stratagene, La Jolla, Ca.) was 
25 cut with Hindlll and end-filled. The linear plasmid was 
then cut with Smal , blunt-end ligated and transformed. 
The resulting plasmid contained an ampicillin resistance 
gene and origin of replication and lacked Sail and SphI 
sites but retained a unique Kpnl site. This plasmid was 
30 cut with Kpnl and ligated with KpnI-cut pOAV287Cm. 

Plasmids which were doubly resistant to ampicillin and 
chloramphenicol were selected and grown. One of these was 
cut with Sail to release the pACYC184 sequences, religated 
and transformed. The resulting plasmid pOAVlOO contained 
35 the AmpR gene and replication Ori inserted in the Kpnl 
site between the ITR's of the genome (Figure 7). This 
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plasmid replicated stably in E. coli strain JM109 when 
maintained in the presence ampicillin (200fag/ml). Large 
quantities of plasmid were grown for trans feet ion studies. 
Trans fee tion of DNA and Virus rescue 
5 To determine whether the recombinant genomic clone 

was infectious, pOAVlOO was cut with Kpnl to release the 
linear viral genome and DNA was trans fected into CSL503 
sheep foetal lung cells using lipof ectamine (GibcoBRL) . 
Solution (A) containing plasmid DNA (2-10jig) and 300|il 

10 EMEM (containing hepes + glutamine) , but lacking foetal 

calf serum (FCS) and solution (B) containing lipof ectamine 
(lOp.1) + 300|il EMEM (containing hepes + glutamine), but 
lacking FCS were combined, mixed gently and incubated for 
45 minutes at room temperature- Subconfluent CSL503 cells 

15 in a 60mm petri dish were rinsed with 3ml EMEM (plus hepes 
and glutamine) lacking FCS. EMEM (plus hepes and 
glutamine) but lacking FCS (2.4ml) was added to the 
mixture of solutions A and B, mixed gently and added to 
the rinsed CSL503 cells. Cells were incubated for 5 hours 

20 at 37°C in 5% CO2 • The incubation medium was changed 
using complete EMEM plus FCS (10%) and cells were 
incubated at 37°C in 5% CO2 until virus plaques or 
cytopathic effect was visible (7-15 days). 

To confirm that viruses rescued from trans fection of 

25 pOAVlOO and pOAV200 were derived from those plasmids a 
portion of the genome of wild-type OAV287, OAV100 and 
OAV200 viruses was amplified by PCR. For OAV100 a primer 
pair spanning the region of the mutated SphI site at bases 
8287-8292 was used. For OAV200 the primer pair spanned 

30 the insertion site for the Apal/NotI polylinker between 
the pVIII and fiber genes. Wild-type OAV287 DNA was 
amplified as a control in each case. DNA amplified from 
wild-type OAV2 87 was cut with SphI whereas the DNA 
amplified from OAV100 was not (Figure 8A) . Similarly 

35 OAV200 DNA was cut with Apal, EcoRV and NotI whereas 
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OAV287 DNA was not (Figure 8B) . Other viruses were 
similarly characterised by restriction enzyme digestion. 
Identification of MLP/TLS elements and Construction of pMT 
OAV2 87 TLS elements were identified as follows and 
5 as described (17)- mRNAs present in OAV2 87- infected 
CSL503 cells were copied into cDNA by reverse 
transcription using primers complementary to the Ilia or 
fiber genes. A primer thought to fall within TLS exon 1 
was then paired with each cDNA primer for PCR. DNA was 

10 successfully amplified, cloned and sequenced. This 

identified TLS exons 2 and 3 (which correspond to bases 
8083-8145 and 8350-8412 of Figure 1, respectively) and the 
3' boundary of TLS exon 1 which occurs at base 5044 of 
Figure 1 . A second PCR strategy was then used to obtain 

15 MLP and TLS fragments suitable for assembly into pMT . The 
region in Figure 1 between nucleotides 4861 and 5023, 
thought to contain the MLP was amplified by PCR using a 
plus sense primer which added an Apal sequence at the 5' 
end and a 3 ' minus sense primer which introduced an Ndel 

20 site by point mutation at base 5012. Similarly, the TLS 
was amplified using a plus sense primer which introduced 
the Ndel site at base 5012 and a minus sense primer which 
was complementary to bases 8396-8412 and which added a 
Hindlll site at the 3' end of the PCR product. The PCR 

25 fragments were digested with Apal/Ndel and Ndel/Hindlll , 

respectively and the fragments were cloned into Bluescript 
SK+ (Stratagene) cut with Apal/Hindlll . The resulting 
plasmid was then digested with Hindlll/NotI and a 
synthetic oligonucleotide with Hindi II/NotI termini and 

30 the sequence shown in Figure 9 was cloned to produce 
plasmid pMT. Genes of interest were then cloned into 
convenient restriction sites in the NCS . Gene expression 
cassettes were subcloned as Apal/NotI fragments into 
pOAV2 00 or rescued into infectious virus. 
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Infection of cells and expression of antigens 

CSL503 and other cells were infected with viruses at 
a multiplicity of infection of 20pfu/cell as described 
previously (21) . Infection was allowed to proceed for 24- 
5 60 hr. Cells were then incubated in methionine-f ree 

medium in the presence of 3^s-methionine to label newly 
synthesized proteins. The protein of interest was 
recovered from cell lysates by immunoprecipitation using a 
specific antiserum against the expressed protein (21). 
10 Recovered proteins were analysed by polyacrylamide gel 

electrophoresis and detected by autoradiography or using a 
phophorimager (Molecular Dynamics). 

To characterise the genome in molecular terms , BamHI 
15 restriction fragments representing the entire OAV2 87 

genome were cloned into various plasmids and sequenced 
using methods described in Sambrook (9) and similar 
publications. Sequences were determined on both strands 
by using nested sets of deletion mutants together with 
2 0 synthetic oligonucleotide primers which were synthesized 
from newly determined sequences . 

The viral sequence of 29 , 544 nucleotides (Figure 1) 
is considerably shorter (by approximately 6.5kb) than the 
sequence for human adenoviruses but many genes encoding 

2 5 structural proteins are identified by their homology with 

their Ad2 horaologues (Figure 2). It is clear, however, 
that the ovine adenovirus genome shows major structural 
and sequence variations compared with all other 
adenoviruses studied to date (Figure 2), in the regions 

3 0 encoding both structural and non-structural proteins. In 

particular, 

(a) the reading frames tentatively identified as 
forming the E1A/B regions are named principally on the 
basis of their location in the genome. Very limited 
35 homology can be detected between the 44.5kD open reading 
frame (orf) and the large T E1B protein of other 
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adenoviruses. Homology in the putative E1A region of 
OAV2 87 has not so far been detected; 

(b) in other adenoviruses the E4 region is normally 
located at the right-hand end of the genome- The OAV287 
5 E4? region is tentatively identified based only on the 

presence of a protein sequence motif HCHC . . PGSLQC which is 
found in 18.8kD and 30.85kD orfs in this region. 
Identical or very similar motifs are found in the E4 34kD 
protein of human Ad2 and Ad40 and mouse adenoviruses; 
10 (c) the distance between the end of pVIII and the 

beginning of fiber, which in other viruses defines the E3 
region, is only 197 nucleotides (Figure 4). The E3 region 
equivalent, if it exists in ovine adenovirus, may consist 
of the cluster of open reading frames which are present in 
15 the right to left orientation on the complementary DNA 

strand, at the right-hand end of the genome (Figures 2 and 
5) . However, these sequences show no detectable homology 
with any other adenovirus and the functions of these 
proteins cannot be deduced from such comparisons; 
20 (d) there is a region of approximately lkb which 

lies between E3? and E4? which has a very high A/T content 
(70.2%) (Figure 1). As there are no open reading frames 
encoding greater than approximately 30 amino acids in 
length on either DNA strand it is unlikely that the region 
25 codes for any proteins, unless mRNAs are generated by very 
complex splicing events. This region has no known 
equivalent in any other adenovirus; 

(e) other differences are apparent in the structural 
proteins of the virus. OAV287 lacks homologues of Ad2 
30 proteins V and IX. However, OAV287 has a completely new 
gene coding for p2 8kD which is located on the 
complementary strand of the E1A? region (Figure 2 and 3). 
This is a structural protein with an apparent size of 2 8kD 
by SDS PAGE which, according to N-terminal sequencing 
35 data, is cleaved from a larger precursor. No homology 
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between this protein and others in the databases has been 
detected; 

(f) in most other genomes the VA RNA genes are 
located between the Terminal protein and the 52 /55k genes, 
5 In OAV287 there is no room for them as the reading frames 
overlap. 

These differences serve to emphasize the unique 
character of the OAV2 87 isolate compared with other human 
and animal adenoviruses. In addition, since the OAV287 

10 non-structural regions show little or no homology with 
equivalent regions in other adenoviruses, sequence 
comparisons do not reveal the identity of likely non- 
essential regions of the genome. Moreover the viral DNA 
cannot easily be manipulated to test for dispensable 

15 sequences. 

The present inventors have produced a plasmid 
containing a full length infectious copy of an ovine 
adenovirus genome in which the ITR sequences are linked by 
a short sequence which creates a unique restriction enzyme 

20 site. A plasmid containing a full length infectious copy 
of an ovine adenovirus genome linked to a bacterial origin 
for DNA replication and a marker gene has been produced. 
Partial clones of OAV2 87 genomic DNA were specifically 
modified and initially linked to a gene encoding 

25 antibiotic resistance and origin of replication inserted 
into the unique Sail site of the genome (Figure 6 and see 
Methods). Such a plasmid can be grown in bacteria and 
more easily manipulated. 

The circular genomic clone differs from the 

30 naturally occurring circles that occur in Ad5-infected 

cells (10) and that might exist in OAV287-inf ected cells 
in that the 40 base pair ITRs are joined by a GTAC linker. 
Together with the last and first nucleotides of the genome 
(G and C, respectively, see Figure 1), this sequence forms 

35 a unique Kpnl site (GGTACC) when the ITRs are joined head 
to tail. Other sites such as EcoRI, BamHI , Sail, KasI etc 
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which have recognition sequences beginning with G and 
ending with C are suitable if they are unique as the 3' 
and 5' terminal nucleotides of other adenovirus genomes 
are G and C, respectively. A plasmid with a suitable 
5 antibiotic resistance gene e.g. ampR and origin of 
replication can be inserted at the unique site or 
elsewhere in the genome to form a plasmid which can be 
propagated in bacteria. Plasmids propagated in the 
presence of 200 ng/ml ampicillin in E.coli strains JM109 

10 and DH5-alpha retain the Kpnl sites and inserted 

sequences, indicating that the OAV287 ITR sequences are 
stable when linked in this manner. This approach may 
therefore be used to engineer other adenovirus genomes. 
If desired the GTAC linker sequence can be removed and the 

15 authentic termini regenerated prior to trans fection by 
digestion with Kpnl (or another appropriate enzyme) and 
incubation with T4 DNA polymerase to create blunt ends 

<9)- 

A method for generating linear infectious genomes 
20 from circular plasmids involved digesting the circular 
plasmid containing the full length copy of the OAV2 87 
genome with restriction enzyme Kpnl to generate a genome 
with the authentic 5' nucleotide dCMP. The linear DNA is 
then introduced into CSL503 cells using lipof ectamine as 
25 the trans fee ting reagent. 

To develop a viral genome as a vector it is 
essential to identify region(s) of the genome which are 
non-essential for function. These regions can be then 
substituted or deleted to make room for foreign DNA (11/ 
30 12 ), or they may be the site for insertion of foreign DNA. 

In the human adenovirus genome DNA has been substituted or 
inserted into the El and E3 regions (13, 14, 15) and at 
the extreme right-hand end of the genome between E4 and 
ITR, usually with the concomitant deletion of non- 
35 essential regions to facilitate packaging of the genome 

(16). Adenoviruses will package genomes up to ~6% larger 
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than the wild-type, probably due to physical constraints 

dictated by the capsid structure (11). 

Non-essential sites in the OAV2 87 genome were 

identified by insertion of a polylinker sequence 

5 containing Apal and NotI restriction sites. This linker 

was introduced into the genome copy in pOAVlOO between 

nucleotides 22,139 and 22,130 of Figure 1 by site directed 

mutagenesis to create plasmid pOAV200 (Figure 7). This 

corresponds to a site located in the intergenic region 

10 between genes for the pVIII and fiber proteins which was 

chosen because it avoids disruption of RNA processing 

signals in the region. A transcription termination site 

for the L4 family of RNAs maps 2 6 nucleotides upstream and 

the splice junction between 
15 the tripartite leader sequences and fiber mRNA maps 144 

nucleotides downstream of the insertion site, respectively 

(17). Transfection of pOAV200 into CSL503 cells resulted 

in the rescue of virus OAV200. The second site at which 

the polylinker was inserted was located between bases 

2 0 2 6,645 and 26,64 6 of Figure 1. This created plasmid 

pOAV600 (Figure 7). This insertion site corresponds to 
the right hand end of the A/T-rich region (Figure 2) whose 
function and precise boundaries are unknown. The site was 
chosen as it is six nucleotides to the left of the 

25 transcription termination point for RNAs transcribed from 
right to left from the E3? region (Figure 2). This was 
determined by sequencing cloned RT-PCR-amplif ied cDNAs 
derived from the region using methods similar to those 
described for the pVIII/fiber region (17). Transfection 

30 of pOAV600 into CSL503 cells yielded virus OAV600. 

The above insertion strategy identified two regions 
of the genome which can be interrupted and created sites 
for subcloning gene expression cassettes. 

A further non-essential site was identified using 

35 the unique Sail site located at bases 28644-28649 of 
Figure 1. The site was cut with Sail, end-filled and 
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religated to disrupt the reading frames which spanned the 
site. A plasmid pOAV600S (Figure 7), which had lost the 
site was identified by digestion with Sail. When pOAV600S 
was transfected into CSL503 cells, virus OAV600S was 
5 recovered. The loss of the Sail site in this virus was 

confirmed by digesting the viral genome with Sail. As the 
Sail site falls within two significant open reading frames 
(which extend on the complementary strand between bases 
28457 and 29014 and between 28511 and 28699), which were 
disrupted by end-filling and religation, the gene products 
derived from the reading frames are probably also 
dispensable. This group of reading frames may therefore 
constitute the E3 region of OAV287 as no other gene 
products in any adenovirus are dispensable for 

15 replication, in vitro. This implies that it should be 
possible to delete the whole region labelled as E3? in 
Figure 2. In addition, in other experiments a lkb Ndel 
fragment was deleted from the region marked as E4? in 
Figure 2. This deletion disrupted several reading frames 

20 i n the region. No virus has been rescued from a such a 
plasmid, suggesting that it is not dispensable and 
accordingly, it may be E4 . 

Many viruses replicate incompletely in heterologous 
hosts, often entering cells but being unable to produce 

25 mature virus particles because of a block in the 

replication cycle. In the context of recombinant viral 
vectors, this represents a desirable safety feature, 
provided that replication is not blocked before 
appropriate and effective expression of the foreign gene 

30 occurs. OAV287 does not replicate productively in 

heterologous cell types (18), the only exception so far 
being bovine nasal turbinate cells in which viral titres 
are significantly reduced compared with the CSL503 cells. 
Recombinant forms of OAV287 have been constructed to 

35 determine whether expression of a reporter gene under the 
control of an appropriate promoter occurs. 
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Foreign gene expression requires that the gene be 
functionally linked to a promoter. This may be a viral 
promoter inherent in the genome, or a foreign promoter 
subcloned together with the gene of interest into a 
5 suitable site. The promoter driving gene expression must 
function in CSL503 and preferably a range of other cell 
types. In this work an OAV287 genomic promoter was used 
initially. Subsequently an heterologous promoter was also 
used. In adenoviruses, expression of the structural 

10 proteins is driven by the major late promoter (MLP) . 

Families of RNA transcripts derived from the MLP contain a 
common sequence element, the tripartite leader sequence 
(TLS) at their 5' ends. The present inventors have 
identified those nucleotides in the OAV2 87 genome which 

15 comprise the TLS by using RT-PCR amplification of late 
mRNA transcripts present in OAV2 87-inf ected cells and 
sequencing of cloned cDNAs (17). A candidate MLP was 
expected to be present just to the left of TLS exon 1 
(Figure 2). The MLP and TLS elements were subcloned using 

20 PCR techniques into a separate plasmid pMT (Figure 9) and 
linked with genes of interest. These promoter/gene 
cassettes were subcloned as Apal/NotI fragments into the 
polylinker Apal/NotI sites of pOAV200. Using this 
strategy plasmids pOAV203, pOAV204, pOAV2 05 and pOAV210 

25 were constructed. These incorporate genes encoding a 17kD 
soluble protein from T. colubriformis, a rotavirus VP7sc 
gene (19), the 45W antigen from Taenia ovis (20) and a 
membrane protein (PM95) from Luc ilia cuprina, 
respectively. Plasmid pOAV202, contained the 17kD antigen 

30 but lacked the MLP /TLS elements . These plasmids were 
transfected into CSL503 cells and rescued as viruses 
OAV2 02, OAV203, OAV204, OAV2 05 and OAV210, respectively 
(Figure 10 ) . 

The human cytomegalovirus immediate early IE94 
35 promoter plus enhancer, which functions in a range of 

human and animal cell types (21) , was also linked to the 
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rotavirus VP7sc antigen gene. This cassette was assembled 
by replacing the MLP/TLS elements in pMT/VP7sc with the 
HCMV enhancer-promoter region. The cassette was inserted 
in pOAV200 to create pOAV206. pOAV206 was transfected 
5 into CSL503 cells and virus OAV206 was rescued (Figure 
10). 

CSL503 and other cells were infected with the 
viruses described above and at various times post- 
infection the cells were radiolabelled with 35 S- 

10 methionine. Proteins of interest were recovered from cell 
lysates by immunoprecipitation using an appropriate 
antiserum. Recovered proteins were analysed by 
polyacrylamide gel electrophoresis and detected by 
autoradiography . 

15 When virus OAV202 was used, no expression of the T. 

coulbrlformis 17kD antigen was observed by 
immunofluorescence. As this virus lacks the MLP/TLS 
elements and carries only the 17kD gene this result 
demonstrates that there is no viral promoter upstream or 

2 0 adjacent to the insertion point between the pVIII and 

fiber genes which is capable of driving gene expression. 
As the E3 region is also missing from this site there is 
no requirement for a nearby promoter. This situation 
contrasts with observations made using a human Ad5 E3 

25 recombinant (21). In this case a promoterless gene' 
inserted 3' proximal to the pVIII gene was expressed, 
probably from the adjacent E3 promoter or the upstream MLP 
(15, 21). This result further emphasizes the unique 
nature of the OAV2 87 genome. Recombinant OAV287 viruses 

30 carrying the MLP/TLS elements were tested for expression 
in CSL503 cells. With OAV204, expression was easily 
detected in infected, but not in uninfected cells at 24hr 
post-infection (Figure 11A) . Similarly, when viruses 
OAV205, and OAV210 were tested, gene products of 24kD and 
35 approximately 95kD, respectively were detected (Figure 

11B) . Therefore it is clear that MLP/TLS elements contain 
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the necessary information to drive gene expression in the 
homologous cell line under replication-permissive 
conditions. However, when OAV204 was tested in a 
heterologous rabbit kidney cell line in which the virus 
5 does not replicate productively, no VP7sc expression was 
observed. Some replication occurs in bovine nasal 
turbinate cells, although to a lower titre than in CSL503 
cells. In the latter cells, expression of VP7sc was 
detected following infection with OAV2 04 (Figure 11B) . 

10 Virus OAV206 containing the HCMV enhancer/promoter 

element linked to the VP7sc gene was used to examine the 
function of a heterologous promoter in the context of the 
OAV2 87 genome. CSL503 cells infected with this virus 
readily expressed VP7sc antigen at 24-48hr post infection 

15 (Figure 12A) . With this virus VP7sc expression was also 

observed in the non-permissive rabbit kidney cell line and 
in bovine nasal turbinate cells (Figure 12B) . These 
results suggest that the HCMV or a similar constitutive 
promoter may be preferred over the MLP to drive gene 

20 expression in OAV recombinants in non-permissive cells • 
One recombinant virus was also administered to 
sheep. Five sheep were vaccinated intracon junctivally and 
intranasally with 0.7xl0 8 pfu of OAV203. At three days 
post-inoculation virus was recovered from the nasal swab 

25 of one sheep and from the conjunctival swabs of two sheep 
and confirmed as the recombinant virus by PCR analysis. 
Animals showed no obvious ill effects from such 
vaccination. 

The viral vectors. of the present invention can be 
30 used for the delivery and expression of therapeutic genes 
in grazing animals. In species which are not normally 
infected by ovine adenoviruses the lack of pre-existing 
immunity should allow efficient infection, gene delivery 
and expression. The genes may encode vaccine antigens, 
35 molecules which promote growth in production animals, 

molecules which modify production traits by manipulating 
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hormone responses and other biologically active or 
therapeutic molecules . The virus does not replicate 
productively in many non-ovine cells but the use of 
heterologous promoters allows the delivery and expression 
5 of genes while minimising the possibility of virus spread 
to a non-target host. As the DNA of adenovirus vectors 
can persist in cells in an unintegrated form, with the 
appropriate choice of promoter, expression over a 
prolonged period can be achieved, 
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CLAIMS : 

1. An isolated DNA molecule comprising a nucleotide 
sequence encoding the genome of ovine adenovirus (OAV287) 
substantially as shown in Figure 1 or a functionally 

5 equivalent nucleic acid sequence. 

2. The DNA molecule as claimed in claim 1 such that the 
nucleic acid sequence encoding the genome of the ovine 
adenovirus is substantially as shown in Figure 1. 

3. An isolated DNA molecule comprising a nucleic acid 
10 sequence encoding the genome of ovine adenovirus (OAV287) 

substantially as shown in Figure 1 wherein a portion of 
the adenoviral genome not essential for the maintenance or 
viability of the native adenovirus is deleted or altered. 

4. An isolated DNA molecule comprising at least a 15 
15 nucleic acid base sequence being substantially unique to 

the ovine adenovirus (OAV287) nucleic acid sequence as 
shown in Figure 1. 

5. The DNA molecule as claimed in claim 4 such that the 
at least 15 nucleic acid base sequence encodes a 

20 functional element of ovine adenovirus (OAV287). 

6 . The DNA molecule as claimed in claim 5 such that the 
functional element is selected from the group consisting 
of promoter, gene, inverted terminal repeat, viral 
packaging signal and RNA processing signal. 

25 7. The DNA molecule as claimed in claim 6 such that the 
functional element is the inverted terminal repeat having 
the nucleic acid base sequence 1 to 46 as shown in Figure 
1. 

8. A plasmid including the DNA molecule as claimed in 
30 any one of claims 1 to 7 . 

9. A plasmid including the DNA molecule as claimed in 
any one of claims 1 to 3 such that the nucleic acid 
sequence encoding the adenovirus genome or a portion 
thereof is linked to a nucleic acid sequence encoding an 

35 origin of replication and a further nucleic acid sequence 
encoding a marker. 



WO 96/03508 



PCT/AU95/00453 



27 

10. The plasmid as claimed in claim 9 such that nucleic 
acid sequences encoding inverted terminal repeats of the 
adenovirus are joined. 

11. The plasmid as claimed in claim 9 or 10 such that 

5 the nucleic acid sequence encoding the marker encodes for 
resistance to an antimicrobial agent. 

12. A viral vector comprising a DNA molecule including a 
nucleic acid sequence encoding the genome of ovine 
adenovirus (OAV2 87) substantially as shown in Figure 1 or 

10 a functionally equivalent nucleic acid sequence or a 

portion thereof and at least one nucleic acid sequence 
encoding a non-adenoviral polypeptide or polypeptides . 

13. The viral vector as claimed in claim 12 such that 
the nucleic acid sequence encoding the genome of the 

15 adenovirus is substantially as shown in Figure 1. 

14. A viral vector comprising a DNA molecule including a 
nucleic acid sequence encoding the genome of ovine 
adenovirus (OAV287) substantially as shown in Figure 1 
wherein a portion of the adenoviral genome not essential 

20 for the maintenance or viability of the native adenovirus 
is deleted or altered, and at least one nucleic acid 
sequence encoding a non-adenoviral polypeptide or 
polypeptides . 

15. The viral vector as claimed in any one of claims 12 
25 to 14 such that the nucleic acid sequence encoding the 

polypeptide or polypeptides encodes a polypeptide or 
polypeptides derived from bacteria, viruses, parasites or 
eukaryotes . 

16. The viral vector as claimed in claim 15 such that 

30 non-adenoviral polypeptide is rotavirus VP7sc antigen, the 
parasite polypeptide is Trichostrongylus colubrlfonals 
17kD antigen, the Taenia ovls 45W antigen or the PM95 
antigen from Lucilla cuprina . 

17. A method of delivering a DNA molecule having a 
35 nucleic acid sequence encoding a non-adenoviral 

polypeptide or polypeptides to a target cell, the method 
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comprising infecting . the target cell with a viral vector 
as claimed in any one of claims 12 to 16 such that the DNA 
molecule encoding the polypeptide or polypeptides is 
expressed and the polypeptide or polypeptides is produced 
5 by the target cell. 

18- A method for delivering a DNA molecule having a 
nucleic acid sequence encoding a non-adenoviral 
polypeptide or polypeptides to an animal, the method 
comprising administering to the animal a viral vector as 

10 claimed in any one of claims 12 to 16 , such that the viral 
vector infects at least one cell of the animal and the 
infected cell expresses the DNA molecule encoding the 
polypeptide or polypeptides and produces the polypeptide 
or polypeptides . 

15 19. The method as claimed in claim 18 such that the 
animal is a grazing animal. 

20. The method as claimed in claim 19 such that the 
grazing animal is a sheep. 

21. A viral vector comprising a DNA molecule including a 
20 nucleic acid sequence encoding the genome of ovine 

adenovirus (OAV28 7) substantially as shown in Figure 1 or 
a functionally equivalent nucleic acid sequence or a 
portion thereof and at least one nucleic acid sequence 
encoding a functional RNA molecule. 
25 22. The viral vector as claimed in claim 21 such that 
the functional RNA molecule is an antisense RNA molecule 
or ribozyme. 

23. A method for delivering a DNA molecule having a 
nucleic acid sequence encoding a functional RNA molecule 

30 to an animal, the method comprising administering to the 
animal a viral vector as claimed in claim 21 or 22, such 
that the viral vector infects at least one cell of the 
animal and the infected cell expresses the DNA molecule 
encoding the functional RNA molecule and produces the RNA 

35 molecule. 
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Fig 1 

CTATTCATAT ATATAACGTT GCACAGAGGC GGGGCGTGTG GGTTTTTTAT TGTTTATTGT 60 

CATGGAATTT ACAAAGAAGT AAGTTGTTGG ATCTTTATTC ACAATTCTTT TAACAATGAC 120 

TTTTTTACTT ATTACATTTT TCATCTTTTT TACTTCACAT GATATTTTAC TTAAATTTTG 180 

TACATACAAG CCAAAATTCG CATAAAATGT CTTACTTTAA AAAGTTAAAT TTTTTTTTTA 2 AO 

ACGCATAAAT GGACGTACAG CAGCAATTGG AATAGCAGGA AGGGCCATTG TAAAGTGTGT 300 

TCCTGCTGAT GCCGCTGCAG AAAGGATAGA TGCTATCGTA CGCATAAACC CCCCTCCTAT 360 

TTGTTCATCT GCTGCTTTTA TTATATCTTC TGCCAATCTA GGTGATATTT GCTTTTGAAT 420 

GCTGTTTCCA AAAGCTTGCA TCATCGGATT TTCAATTAAA TGGATTGGAT TTGCAGAATT 480 

TCCTTAAAAA TAGCCCAACC CATCTAAAGC AGTTAAAAGT ATTCTCCCTC CAGGAACCAC 540 

AGATATAATT AAGCGGAGCA ACCGAGAGGT TAAATTCCAG GGTCCTCCGA AGAGAGTATC 600 

TAGGATCAGG CCAAGAAGTG AACCAAAAAG ACTTGTAAGT AGAAGTTGTC TGATATGCTT 660 

TGGAGAGGAC TGTTAAAATT GCAAAACGGT ATCTAATGAC CATTTCTTCT TTACTTTTAC 720 

ATCTGTATCA TGTTCTCCAT CAGAAGGTCT TATTGGGAAG TACCATTGGT CAC GAG CATC 780 

TTTGAAGACT TCTGTTTCTT GAAATTCTGT TTTCGGTAAG CGACTAGCAG TTATGGTATT 840 

AGGAATATTG ACGGTAATGT TATTCACATC TACAATTTCT GGAGGAATCC ATCTTGCATA 900 

GGATGAAATG GGTTTTGTGG GTTCTTTCAA TATATAATTG CGAGGAGGGT TTTTCCAAAA 960 

TCTCTGAACA TAAGTATTTT CTGATTTTGG CGGTTTTTTG CTTTTTCGCG CTCTTTTTCT 1020 

TGGCTTTGGT CTTTGAAATT TTTTCTTCCT TTTTCTGTAG GCTCCTCCTG CTAAAGCTGT 1080 

GTTATTTGTG ACGTACATCC TGTTAGCTAC ACGATTTTCC CGGACTGCAA ATTTTTTTGC 1140 

CAAATGGAAA AGAAATTGCT GAAACCTTCT ATTAATCATA TAAATTGTCA GTGGAATCAT 1200 

GAAT CAGATA GTGCAGGATT TTTTCTTTTT GATACTGATA ATTTATACTA TTATGTATTG 1260 

GATCAAGTGT CTTGGATATG TTTAAGAGAT ATAACTCTTC ATTGTGATCG CATGTGGTTA 1320 

GCGGTTTGTT TTTGTTTGTG CAAATCTAAA TTTGATGTAC ACAATATTCT AGCGGGAGTA 1380 

CATGTTATGT AATGAAAATG ACGTCGGGGA TTGAATGGAT TGAGCCTTAT TTGACATTTT 1440 

TCTGTGATTT TTTTGCCTTA TTAGGAAATA AATTTGTGGC GCCAGTACGA TGGAGATTGG 1500 

AATGACTCCT GCATTTACAG AAAGGAATTT GTACTGTGTT TTGCTTGACT TTAATTTAAG 1560 

ATGGTATCAG CAGATATTTA ACCCAATATG GATTAAGCCA AATTTATGGG CTTTCTCTGA 1620 

TTTTTTAAAA AAAATGGCCT TTATTTATGC TAGCGACTTG GCGTTGTTAA ATTCTTACAT 1680 

CCCTGGTAAT GTTTGTAACA AACTTGATAT CATCAAGAAA GATCTTCCTG AAGATTTTAC 174 0 

CGTGTCTATG TTTTGTGTCT TAGTGTGTTG GCTTGCTTCT TTCTGTAAAG GTTCTAATTT 1800 

AGCTGAAACT CGCCAGAATT GTCACGCGGT AAGCAAATTT CTGGCACAAC TATCAAAATT 1860 

AATAAAACCC TAATTTTTAG TTTGTAAAAA TAGAATTCAA ATTTTTAACG CCACAATGAC 1920 

TTCGGCGGAG TTTTCTGTTG AATTTCCTTA TGTTTCTAAG CCAATTGTTC CATGGCCTGC 1980 

TTCGGCATCT TCTAATAATT CATCGAGTCA GAATATTGAC TTTCCTGTTC TTAAACCAGA 2040 

TCAAGATCCA ATAGCCTTCT TTCAAACTAA CAATACGGCT TACTTACAAC CTGGAGCTAC 2100 

TTATTACTGG AAGTGTATCG AACTGTCAAA GCCTATTCAC ATTTACGGTC AAGGAGCTAC 2160 

AGTACAACTT GTCGGACCTG GACCTGTGTT TGTTTTCAAC AGTGAAAGTG TTATTCCTGA 2220 

AGATTTTTAC GTCGTGTTTG AAAATATCAA CTTTATTGAA GATGAATTTC CTATTAGAAG 2280 

TGGCCAGTTA AGTTTAGGAC TTACAACTCA CAGTGCTGTA TGGTTTATCA ATGTATGGAA 2340 

AACTTCAATA GTCAATTGTA ACTTTAAAAA TTTTAGGGGA GCGGCTCTTT GGTATTCAGA 2400 

TAATAGAAAT TTTTGGAATG CGAGAAAATG GAATCAGCAG CATTTAGTTT CAAATTGTCG 2460 

TTTTAATGGT TGTAGAATTG GAATTTCTAA TACTGGTTCA TCTGAATATT CCATAGCCAG 2520 

TCAAAATCAA TTTTATGATT GTCAAATCTG TTTTAATGTA ACCGGGGGTA ATTGGTCTAG 2580 

AAATAATAAT GTTATTGTTA ACTGTAGATG TGCTTATCTG CATGTTGGAG ATAACATGTG 2640 

GTATGAAGGC CATTCCGAAA ATAATAATCC CGCTAAGGGT ACTTTCTGCA ATAACATAAT 2700 

TAACCATGCT GATAACGGAG GCAATGTCTG GCCTACTCAG TTTAAACTTA CAGATGGATC 2760 

AACGATACAG TTAGCATCAT TTTATTTTGA TGATAATCAA GAAATTCCAC CTTGTTATAG 2820 

CGGTAATTTT CATTGGTTTG GAGATGTAAA CATTGTAAAT TTTTCTACCA CAAAAATTGA 2880 

TAAATGGTGC ATTACTGGAT GTAATTTCTA TGGTAATACA CATGCAGCTA ACGATGCTGG 2940 

TCAAGTTCAG GTTGCTGAAG CTGTAAAAGA CAAAGTGTTT ATTATTGGGT GTTCTGGTAA 3000 

TAATGTAACC ATGAAAAATA TTGTAGAAGG TAACATGACT CCAAAAATTG GTACAATAAA 3060 

GTAAAAAACT TTTTATTCAA AACAAAATGG ATTTACATTT AAACGTTTTA CATATTGATT 3120 

CTGCGTATAA GTTCTTTTTC TAAACACTCT TCTAATTTCC ATACATGCTT GATAAAACAA 3180 
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Fig 1 (cont) 

ACTTTGTAAA TTCATAAATA TAGGTTTGAC TTGATCAGAA GGTGAATAAT AGCTCCATCT 3240 

AAATGATTCG GTAATAGGAA CATTATTATA TATTAACCAG CTATATTTTG AGTTAACTCT 3300 

TGCATGATCC ACTATATCTT TAAGTACAGG GATAAGTGCA CTCGGAAATC CAAAAGAATA 3360 

GTTTTTAATA AATCTATTTA TCTGTGAAGA ATCAAGCTGC GGACTAATAA CATGACATTT 3420 

TGATTGAATT TTTAAATCCT TAATATTTCC TCTATCATGA CGCGGGTTCA TATTATGTAA 3480 

AACTACTACA ACAGTGTAAC CATTACATTT GGCAAATCTA TTAAAAATTT TTGACGGTAA 3540 

AGCATGAAAG AAAGAACTTA TAGAATGACA TGATCCCAAT TGATTCATAC ATTCATCTAT 3600 

TATAATACAG ATAGATCCTT CACTTGCAGC TCTGCAGAAT ATATTATCTG GATTATCAAT 3660 

ATTTAGATTA GTATCGGAAA TAGCATCTTT GAAAGCTAAT TGTATAAATT TTGGATTTAA 3720 

TGTTTTTGTT AGTGGATTAG AGAATGCATC GTAGTTTCCT TCAACACACT GTGCTTTCCA 3 780 

CGCAATTTTT TCTTCTAATG GAACAGTACC TTTTTCTGGA GTTATGAAAA AAATTGTTTC 3840 

TGGTATTGGA TCAATTAGTT TTCCAGATAT AATATTTCTT ATAAATTGAG ATTTTCCGCT 3 900 

ACCTGTGGGT CCATATACAG TAACAATGAA TGGTTGTAAT CCGCAGTTTA AACTGGGTAT 3960 

ACAGCCATCT TTTAACAGAT TGTGAGCCTC ATTTACAGTT TTTTGATAAT TTACAGCAAT 4020 

ATTGTGTAAA TCAGTCATAA GTTGACCATG ATACATACAT TTATCAAAAA CTTCTTGACT 4080 

TTCTGGAAAT GGATTTCTGC AAATAGAAGG ATCTATCTTT ACAACATCAT TTTTCCAATT 4140 

TAATGTGTCA CTTAAAAATT TTCCCAAAAA GGATTTTCTG TCAATGGTTC TTGCGGTCTT 4200 

GGATTTGGGT GTCTCTTGTC GTACGGGTAA AGTAAGTATC CTTTCTTCCA CTGGATCCCT 4260 

TTCCTCATCG TTTGATCCTT CCAAGGTCTC AGAATTCTGG TTAGTTGCTT CTCTACCACC 4320 

GTGAATGGTA CATCGGTTCC ACTTGCGGTT TGCAGTGTCT TTTTTAAACT TTTCCTCGAT 4380 

GTCTGAAACT CTTTCTGTGG TTGTTCTAAT AAATTATAGT CAGTAAAACA ATGTTTTAGA 4440 

ATTTCATAGT TTAAACAATT TTTAGCATGA CCTTTGGCTC TTAATTTTCC TTCTCCAATA 4500 

AATTTACAGT TTTTACAAGT TATGTCTTTT AAAGCATATA ATTTAGGAGC TAAAATACAT 4560 

GTTTCTGAAC TGAATGCTTC AGCTCCGCAA CGGTTACAAA CAGTTTCGCA TTCAACCAAC 4620 

CAAGTTAGAC ATGGATGTTT TTCATCAAAG ATTAAATTTG AGTTATATTT TTTAAGTCTA 4680 

TGTAATCCTT TTGATAACAT GAGTTGGTGG CCCTTTTCTG TTAAGAATAA CGAGTCTGTA 4740 

TC AC CAT AAA TACTTTTTAT CTCCCTTTCT ATGTAAGGTT TACCCATATC TTCCCCATAT 4800 

AAAATTTCTG CCCACTCACT CATGAAAGCT CTGGTCCAAG CCAGCACAAA GGATGCTATC 4860 

TGAGTTGGAT ATCGGTTGTT CTTGATCCAT TCTTCCTTAT CCTCAATAGT TGTTAAAATT 4920 

AAATCATTAC AATCAGCAGA TAAAAAAGTT ATAGGCTTAA AAGTCACGTG ATCTTGATTT 4980 

CCTATAAAAA GTGGAAAATT AAAATTTTCA TTTGTGTCTT TGGAATCTTT GGGCGGCATT 5040 

TCAGGTAGGT TTGAAAAATA CTGATTCCAC TCAAATGAAC GTTTTGGTAA TGATTTACTA 5100 

ATCACAGTTG TGTATGATGT AATTTCAGCT GATCCATTTT CTAATCTTTT TTTATCTTTC 5160 

TCTTCAATAT TTTCAGCAAA CACTACTTTC TTTTTATCTA TACGGGTAGC AAACGAACCA 5220 

TATAAAGCAT TTGATAACAA TT TACT TATA CTTCGCTGAA TCTTGTTGTT ACTTTTACTT .5280 

GCTTTTTCTT TAGCCATAAT ATTTACTTTC ACATATTTTT GACATAACGG TTTCCAGTCA 5340 

CTCCATACAG CATACATTTC AGAGCTTTTG ATTATTTTGC ATTTCCATCC TCTATTGTGT 5400 

AAGGTGATTA AATCGATAGA GGTCAGTACT TCATTTATCA ATGTTTCATT TGACCAGCAT 5460 

AACTTTCCAC TTTTTTTAGA ACATAATGGA GGTAACACAT CAAGATAATC TAATGATGGG 5520 

GGTTCACAAT CGGCTACCAC AATCATAGGT TTGATTGAAT TGTCAAAATA ATCTATTTTT 5580 

TCTTTTCTTT GTAGTAGTTC TTGAAAGTAA TCTATTTGTG CATTGGCTTC AAAAGCATTT 5640 

AAAGTTTTTC CATATGGAAG TGGATGCGTT AAGGCACTAG CATACATTCC GCAGATATCA 5700 

TACACATATA TTGCTTCTTC AAATATTCCT AAAAATGAAG GATAACATCT TCCTCCTCTT 5760 

AAACTCATTC TAACAAAATC ATACATTTTT TCTGATGGAG CTTCCAAATT TCTTAGGAAT 5820 

TCAGAGGGAT GATCTTCTTC ATTATAAAAG ATTTGTTTAA ACAATGCTTG AG TAT T ACTA 5880 

CTAATTGTAG GACGTTGGAA TATATTAAAA GAACACTCAA GCTTTAAAGA TGTTGTACAG 5940 

AACTCTTGAT AACCTTCTAT AAGTTTTTCA ACTAATTGAG CCGTAACTAT AACATCATCA 6 00 0 

ATACAATACT CCTTAGCTTC CTCTAATAAG TTGTATTTTT GGTTGTGTTT TGGTTTGTTT 6060 

TGTAAATATT CTTCAAATGA ATTCCAATAT TTTTGAACTG GATAACCATT GTTTTCTTTT 6120 

TCATATTCTC CCAACATAAA AAAATCATTG ATTGCCCTGT AAGGACAATA ACCTTTGCTA 618 0 

ACACTCAACT GATATGCAGT AGCAGCGTCT CTTAAAGAAG AGTGGGTTAA CAAAAATGTA 6240 

TCCCTAACCA TAAATTTTAT ACCTTGCCAT TTCATATCTT CAAAATTAAT AATTCCATTT 6300 

TTCCATCTTT CATAAGTTGT ATGTGAAGGT TTCTTAAAGC AAGGATTTGG AAGAGATAAT 6360 
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Fig 1 (cont) 

GTAATATCAT TAAATAACAG TTTTCCAGCA CGAGGCATAA AGCTTCTTGT CAGCTTAAAC 6420 

ATTGAAAGTT CTTCACTGTC TATTCCTTCT AATACATGAC TTGCAAGTAT GATTTCATCA 6480 

AAACCACAGA TATTATGACC TACTACATAT AATTCAATAT ATCTTGGTTC GCACTGTTTT 6540 

AATTTTTTTT CTTTATTTAA GACCATGATG TCTTCATATG ATAAATTTGA TTCAAGACCA 6600 

TGATTTTCAC AAAACGTTGA CCAGTATTTT TTAGCTACTG AAATTTGTAG CTCTGTTCTG 6660 

AATTTTTTAA AAGCTATGCC AATTTCATCT TCTTTTTTAT TTAACATTAC AAAACATTCT 6720 

CTGTTTACCT CATAACCTAT ATCGGTAGCT ATTTTAGAAG CAATTTTTAT GAGTGATTTA 6780 

CATCCAATTA ACTTAAAAAC CAACAAGTAA GGAGTTAACT GTTTTCCATA CAAAGAATGG 6840 

TAAGTATATG TTTCAATATC ATAAACAATA AAAAGACGTT TTGCTTTTAT GGCTCCAACT 6900 

GGATTAAATT TGATTTTTTC CCACCAGAGT TTTGTTTCAT GGTGAATATT GTGATAATAG 6960 

AAGTCCCGTC TTCTGGATGA GCAGTTGTGT ATATTACTAT AAATTGTTCC GCAGAATTCA 7020 

CATTTATTCT GTTGTTTAAC AGTTTTTATT AAATATATTT CTCCTTTTAA AATCAATAAT 7080 

TCTATTGGTA ACAAATTTCC ATTAAGAATT TCTTCAGTCA TCTTAAAAAA TCTTTTGTTG 7140 

AACTTCCATA TTTTTAAAGA TACGGGGGTG TTAGAATCAC AAAGTTTTAA AACATCTAAA 7200 

ACATTTTCTA CTTTCTTGAA AGAATTTAAT TTTAAACCCT GAATTGCAAA GTAATTATAA 7260 

AAACTTTTTT CAAAATTCTT GTAGTATATA ATTTTTATAT ATGTATCCTC ATATATTCCA 7320 

GTAATATAAG TAGTAGTTCT TTGCTTTATT ATTGTCTTTG AAGCCATCTG TTTAAAGCCG 7380 

CTTCCCGTAC TCGCTCAAAG CTTCTTAAAA CAACTTCATT TGTACTATAG CCAACAATTC 7440 

CAGACAATTT TATTCTAAAT GCTATTTCAA CTGAATCTAA ATCTGAAAAA TCCGTGTTTA 7500 

CTTGGTTGAT TACTTCTTCT ATGCTCCCAC TGTCTTCTAC GAAGTCTATA TCTTGAAGTA 7560 

ATTGGTCTCT TTCTTCTGGA GTTGAAAAAG AGTAAGATCT TTCATTAGCT TCTATAATTC 7620 

CTAAAAAATC ACGAGTTATT CTGCTATATA GTTGTCTGAA TGCTTGTGTT TCTCTATTAA 7680 

ACCAAACTCT AGTAAATATA TCTTCTCCAT TTTCATTTCT ACCTCTTAAT ATAATTTGAA 7740 

CAAATTGGAT TCCAATATTT CTGGCAGCTA ACCTATTTTG CACTAAATTT AAGTATAAGT 7800 

AATATAGCGT GCTTGCCACA TGCTCTAATA TAAAGAAATA CACTAACCAT TTTTGAATAA 7860 

AATCATCAGT CAATCTATTT TCATTATAAA ATCTAATAAG TAATTGAAAA AATTCACTTC 7920 

CGTAATTAAA AAAATTACTC CTTCTTGCTT CAGGAGTTAA TTCTTCTTCT AAATTTTGAA 7980 

TTAAATCTAC TATTGAAGCT ATCACTTCAT CATTAAATTC TTCCCTACTC AGATCGCTTG 8040 

AGCTCGGCTC GCGATCTGAA AATCCTTCAT CTTCTATTTC AGGAACAGTA AGAGGAGAAC 8100 

TAGAAGTTTC TTCAACATTC CTTACCCTTT GGCGTCTATT AACAGGTAAT CTATCAATAA 8160 

ATCTTCTGAT TACATCACCC CTTGAACGTC TCATTATTTC AGTAATAGCT CTATAATTTT 8220 

CCCTAGGTCT TAATCTGAAT GGTAATCCTA CTCTTGTCCC TGACCTTAAA GTTAATGCTC 8280 

CACCATGCAT CCCACCTTTT CCTAAAGTTA ATACAGTTGC TAAATCTTTT AAATTAATTC 83 40 

GATTTTCAGC TTCTGGAATT TCCAGCTGTG AAAATTCATC TATAAAAAGC TCAATCCAGA 8400 

ATTCAGAAAA AGGTAAGTCT AATATACATT CACTATTATG CATGTTAGAC AAAATTAAAA 8460 

ATTTACATAA AGCTTTTTTA ATTTTACAAA TTAACTTTAT AAGGTAAGTA TCCCTTTCTT 8520 

GCAAATTTAA AACCATAAAA GCTTGAGAAA AAGGTTGATA ATGCTGCTGA AAAGATCTAT 8580 

TCTGATTTTG AGCTGAAATA GCGGAGCCAA AACCTTGCAT GTCTGCAAGT TGCAGACTCC 8640 

CTAATATTCT ATCCATTAAA ACCGCGTTTT GAATTTGACT AATTGTTTGT GAAAAATTTT 8700 

CTACATTTTG AATTGCTCTC ATATATGACC CAGTATTTAT GGAGTATGAA CAATCAGTTA 8760 

AAATTTGCCA GGTCATGCGT CTCTCAAAAC TTATAGGTGA AAGATACAAC TTATATGAAA 8820 

TGTTGCTGTA AGTCCGCTGA TCAAACAGAT ACTGGTTTAA AACTCGCGCC ACATAAAAAT 8880 

ACCCAATTAA TAAATTTGGT GGAGGTTCTC CTTCAAATGG TGGTTGTGAA GTAACAGGTC 8940 

CTCTTGGGCG TAAATCGAGT AATTGAGTCA CTGGATAATT AAAAAATCGA TTAGCCCATT 9000 

TTATTCCCCT TTCATGTATA GTCCTTGACC TGGCAATACT TCGATTATTA AGGTCAAGTG 9060 

TTAAACGTAA ATATCGTAAG GTATGTTGAC TTTGCCCAGT GAGTTGTTGC CATTGGTGAA 9120 

TCTGCAAGGC AAACAAAAAA TTTATCTTAT TACTGCAGAT GCATCCTATT TTACAAAATT 9180 

TACGTTCATC ATTGGAAACT CCAGACTTAT CAAGCAACTC CCCGGGCACG TCAAATAAAA 9240 

ATGAAAAAGA TGAATTTGAA CCAGCAGTTG GCATTTCTAG CAAACCATCT GATGAATTTA 93 00 

AT AT GAG AC G ATCTCAAAGA GATGATAATT TACCTAAAAG TCAGATACCA GTAGTAGATA 93 60 

TACTACATGA TAAAAATCCT AAAATGGCAG AAGAACGAGA CTTAATGTAT AAATCTTCTG 9420 

CTTGCATAAA ACTTGATGAT TCTAAACAAT TAAAAACTGA TATGTTCAGG CCGGATTTTG 9480 

CTGGAACTAG TCCAGCTCAA AGACACATAG AAGCCGCAGA GCTAAAGAGA AATGGATCTT 9540 
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Fig 1 (cont) 

ATACTCGTAG TTTAGAACAA TGGACACATG ATTCTTTTAT AAGTCATGTT AAACAATTAC 9600 
TTTCTAGACC ATTTATATCT CTAGGTATTA CATATTTGGA TGATTTTTTG CAGACTTATT 9660 
TAGATCATAC TGAATCGTCT TCTTTAAACT TTCAACTGTT TACTTTAATA AATCACTGTT 9720 
CAGAAAATAC TTTAAAACGG ATTTTAAAAC ACATTTCTAA AAAAAATGAA AAAAATCAAT 9780 
ATGTAAATCA ATGGTTGATT GATCTCATTA CAT GT AT AT A TCTAATTATA AGAGATGAAC 9840 
AAAATGTTAC AGAACAAGTT AATGCCCTTT TAGTAACTAG TAATCACTTA GCTTTACATT 9900 
TTGCAAAGAA AGCTACAGGT GGATTCTATC CTACAGCAGA CAAGTTAGCG AAGACTCATA 9960 
TTTTTTTCAA GAGAATAATT TTAGGAATAC TTTCGCTAGC AGAAAGTATA GGTTGCTATA 10020 
CTGTGAATCC ATATTGCAAA AATCCTTTGA AAAAGTCAAA AGTAGAAGTA GAACCAAGTG 10080 
ACGAAATGTA TATGTTCAGC TTAAAAGGTG CACTTGAACA TCCTGATTCC GACGAAGACG 10140 
AAGACAGTGG ACTTCAAAAT GAATAATTAT CATAAATGGA CTTCTAATGT TAT AG AT GC A 10200 
ATTCTATCAA ACAAAGCTCT TTTAGCTATA AAAATTTTAA AAGTCAACCG TTTGCAAACA 10260 
AATTGAATGC TTTAGAATCA GCAGTTGTGC CTCCAAGAAA AGATGATACT CCTGAAATGA 10320 
TAGCAAATCT TTTAAAAGAA TTAGTTGCTT TGGGAGCTAT TCGCAGTGAT GAAGTTGGCC 10380 
CATTATATTC TGACCTTCTT ATCAGAGTTC ACAAATATAA TAGCTTGAAT GTTCAATCAA 10440 
ATTTGCAAAC TTTAACAGGA GACATTAAAT CACTTCAATC CGATATAATT AGAAGTTCCG 10500 
ATATTCCCAA TTTAAGTAAT CAAGTTGTTT TAAATACATT TTTAAATTCT TTGCCCTCAA 10560 
CTGTTACATT TGGACAACAT AATTATGAAG CTTTTAAACA AACTCTAAGA TTATTTGTTA 10620 
ATGAGACACC TAATATTACA GTTTTTAGAT CAGGAAATGA TACTTTAATT CAGGTTAACA 10680 
TAACAGGAAT TCATACAATT AATTTGAATG ATGCATTTAA AAATTTAAAA AATTTTTGGG 10740 
GAATAGTATT AACAGGTGAA TTTATTCCAG GT GAT ATT AC AAGCAGACTA ACAGCTAATA 10800 
CAAGAGTACT GCTTTATTTT CTTGCTCCTT TTACAAATGA TAATACATTC ACACCTGATA 10860 
CTTTTCTAGC TTTACTCATG AAATTATATA GATTGACAGT TTCTTCTGCT TTAGATTTTG 10920 
AAGAAGAAAC TGAAGCTGAA GTAGAAAATG TAGCTCAACA AATAGGATCC ACTAGTGCAG 10980 
ATTTTACAAA GACTTTAGGA TATCTATTAA AAAACAAAGA AGAATCATTT TCGCCTCCCA 11040 
AATCATTATC TCCTAGACAA CTGGGTATTT TAAGGTTCAT ACAGAAAAGT CTGGTAGATA 11100 
AAATTGATAG AAATAATGAA GATCCATGGG ATGCTTTAGA AACTTTATCT TATTCATTTT 11160 
CTCCGTCATT TTATGAGGCC AATGGGCCTT TTATTAGACG GTTAATAACT TATATGGAAT 11220 
TTGCCTTACG TAATTCTCCT ACTTACTTCA GAGAAATTTA CTCCAAGAAA TATTGGATAC 11280 
CACCCAATTC ATTTTGGACT CAAAATTATG CAGACTTTTT TTCGGAAAAG AAAGAAAAAC 113 40 
AAAATTTCGA AACATTTGAA CCGCGGGAAC TTCCTTTACA AATCTCTGAG GAAGAAGCTG 11400 
TCCCGCATAC AGAAGATTTT CAGTCAGCCA TCTCGCCCTC TATGGGCCAA ACTTCACTCC 11460 
CTGCTCCTTC TGTGTCAGAA TACAGTAGCG TGCCTCGGTC AGCTTTTTAC CCTCTCAGAG 11520 
AACGTATCCA AGAGAGCATT TCAAAGGCAG TCATCCCTCC TTTGACAGGC TATGTCGGAA 11580 
AACAAATAGG TGAAACTATT TTCCCTGGTA GTGGAGATCT TGTAGCACCC GCTGCGTCTT 11640 
TAGTTGCAGC ACAATTGGTT GATTCAAGGT TTAATAACAG AAGACAAAGA TTGAAAGACG 11700 
CAGCCAGAAA GCGTCACCGC TATGTTAGAG AGATGCATAA TATTTCTGAT AAAGAGTCAA 11760 
ATGCTTCTAA TGATACGGTA ATATCACCTT TGATTGGACA TGGTTCGCGC ACTGAAAATC 11820 
GTTTTGAATA TTTGAGACCT AAAGGTGGAA ATTATTTATA CTAATAAAAA TCATAACAGA 11880 
CCTGACGGGC GGTCATCCTT TTTTATTAGA TGCAGAAATT TGTACCTCCA CCACGAATCC 11940 
TTGCTCCAAC AGAGGGTAGA AACAGTATTA CTTATACGCC TCTGGCACCA CTGCAAGATA 12000 
CAACAAAAGT ATTCTTTATT GACAATAAGT CTTCGGACAT TGAAAGTTTA AACTTTACTA 12060 
ATAATCACAG TAACTTTTTT ACAAATATTA TTCAAAATGC TGATTTGGCA GCGGATGAAG 12120 
CAGCAACGCA AGATATTAAA CTGGATGAAA GATCTAGATG GGGCGGTGAA CTGAAAACTT 12180 
TTATAAAAAC AAATTGCCCC AATGTTTCAG AATTTTTTAA CAGTAATAGC TTTCTAGCCA 12240 
GATTAATGGT AGATAAAACT GATCCAGAAC ATCCTAAATA CGAATGGGTA CAAATTACAA 12300 
TTCCTGAAGG CAATTACACT GGAAGCGAAC TTATAGATCA ACTTAACAAT GGTATTTTAA 12360 
ACAATTACTT AGAAGTGGGA CGCCAAAAAG GAGTAGAAAT TGAAGACATA GGAGTAAAAT 12420 
TTGATACAAG AGATTTTTCA CTTGGATATG ATCCTGAAAC GGGACTAATT ACTCCAGGAA 12480 
AATATACATA TAAAGCTTTT CATCCAGATA TTATCTTGCT ACCTGAATGT GGCGTAGATT 12540 
TTACATATTC TAGAATTAAT AATATGTTAG GTATAAGAAA GAGATTTCCA TATACTAAAG 12600 
GATTTCAAAT TTTATACAGT GATTTGACGA AGGGAAATAT CTCTCCATTA CTGAATTTAA 12660 
ATAACTATCC TCATTCTATC GAACCTGTAA TGCAAGACGA AAATGGAGTT AGCTATAATG 12720 
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Fig 1 (cont) 

TAGAAAAAAT AAGTGACAAT CCCCCCAGAT GGCAAACAAA GTACAGATCT TGGACTTTAA 12780 

GTTATAAAAA TAATGGAGGA GCTAAAGCCC TAACTGTACT AACTGTTCCG GACATAACAG 12840 

GAGGATTAGG TCAAATTTAT TGGTCAATGC CAGATACTTT TAAAGCACCT ATTACTTTTA 12900 

CTAACAATAC TACAAAGCCA GAAACACTTC CAATTGTTGG ATTACATATG TTTCCTTTAA 12960 

AAGCAGGGTT AGTTCATAAT ATAAATGCGG TTTATTCTCA ACTTTTGGAA CAAATTACAA 13020 

ATACAACTCA AGTATTCAAT AGATTTCCTA AAAATGCTAT ACTAATGCAA CCACCTTACA 13080 

GCACCGTAAC ATGGATAAGT GAAAATGTCC CCTTTGTTGC AGATCACGGG ATTCAGCCAT 13140 

TAAAAAACAG CCTTACAGGT GTACAAAGAG TTACTATAAC AGACGACAGA AGGAGATCTT 13200 

GTCCATACAT ACAGAAATCT TTGGCGACTG TTGTCCCTAA AGTACTTTCA AGTGCTACAC 13260 

TTCAGTAACA ATCTGGCTGA TATCTCTGGG CCTTATCCTC CTGGAACCGT TATGTCTATT 13320 

TTAGTTAGTC CCTCTGATAA TACCGGGTGG GGTATTGGAA CATCAAGTAT GAGGGCTACT 13380 

GGCTTGAAAT TTTCTAAAAA ACAACCTGTT AGAGTGCGAC CTTATTACAG AGCTCAGTGG 13440 

GGACAGCTTA ATGCTCGTAC TTCACTTGAG AAACTAAAAA CCAAATTGAA ATATTATGAA 13500 

AAATTGTACA GGGACAGACT AAAAAGAAAA ACAGTTGTTC CAAAGAAAAA GAGGTCACCT 13560 

ACATCTCCTG CGGATCGACT TAAAAAATAT CTTAAAGCTG TCAGTCAAAT CAAAGCTTTC 13620 

AATAGAGCTA GAAGAGCAGC CCAATAAATA TTATTTTTCA CTTGCAGATG AAGGTAGTTC 13680 

ACGTGCTTAA ATCTCCTCAT CGTCGAAGAC ATACACGTCG TTACAAAAAA CTAAAAAAAA 13740 

TCAATCTATC TCCATACATT TTACCTAAAG AATTGCAAGG CGGTTTTTTA CCAGCTCTCA 13800 

TTCCTATCAT AGCAGCCGCA ATTAGCGCAG CCCCTGCTAT AGCTGGAACT GTAATAGCTG 13860 

CTAAAAATGC TAATCGTTCT TAAAATTTAG AAAACTTTTT TTTTAACAGA TCACATGGCT 13920 

TTTTCAAGAT TAGCTCCCCA TTGCGGCTTA ACACCTGTTT ATGGCCACAC CGTTGGAATC 13980 

TGTGATATGA GAGGAGGTTT CAGCTGGTCT AGTTTGGGAA ATTCTTTTAC TTCTGGTTTA 14040 

AGAAACATAG GTTCATTTAT ATCAAATACT GCTCAAAAAA TAGGTCAATC ACAAGGATTT 14100 

CAGCAAGCCA AACAAGGTCT ACTGCAATCA AATGTTTTAG AAAATGCAGG ACAATTAGCA 14160 

GGTCAAACTT TAAATACTTT GGTAGATATT GGAAGATTAA AGGTAGAGAA AGATCTAGAA 14220 

AAATTGAAAC AAAAAGTTAT AGGGAACGAC CAACAAATTA CTCAAGAACA ATTAGCTCAA 14280 

CTAATAGCCA GCTTAAAACC AAAAGATGAA ATGTTTGTAA AGCAATCAGA AAAAATTGTT 14340 

GAACCTATGA GACCAGAAAT TAAATCTAGC CAAATGCCTG TAGAAATGTC TTTTTATGAT 14400 

TCTGTAAGTG ATGAACCAAT CATAAAAACC AAAGAAGTTA GCCCTCCTTC ATTTTCATCT 14460 

GAATCTTCAC ATTCATATTC TCACCCAAGA AAAAGAAAAC GCGTATCCGG TTGGGGTGCA 14520 

TTTTTGGATA ACATGACTGG AGATGGAGTA AATTTTAATA CAAGAAGATA TTGTTATTAA 14580 

AAACACTTTT TATTTACAGA TGGAGCCACA GCGTGAATTT TTTCACATTG CGGGTAGAAA 14640 

TGCAAGGGAA TACTTGTCTG AAAATCTGGT ACAATTCATC TCTGCCACTC AAAGTTTTTT 14700 

TAATCTTGGA GAAAAATTTA GAGATCCTTT TGTAGCTCCA TCGACGGGTG TAACTACTGA 14760 

CCGTTCTCAG AAACTTCAAC TTCGTATAGT TCCGATTCAA ACTGAGGACA ATGAAAACTT 14820 

TTACAAAACT AGATTTACTT TAAATGTAGG AGATAACAGA GTTGCAGATC TTGGAAGTGC 14880 

ATATTTTGAC ATTGAAGGAG T TAT T GAT AG AGGACCTACT TTTAAACCTT ATGGAGGGAC 14940 

AGCTTATAAT CCATTAGCCC CAAAATCAGC TTTTCCCAAT GCAGCTTTTA TGGATACTGA 15000 

TGAAGCTACA ACAATTTATA TTGCTCAACT CCCTAATGCT TATAATGCTC AAAACAAAGG 15060 

TGTAGAAGAA GCAATTCGAG TAGAAGCAAA CACTACTACT CCTAATCCTC AATCAGGAGA 15120 

ATATGCTACT TATGACTCTG CCAAATTTAA TCCAGAAACT ACTGGTGCTT CTGGAAGGCT 15180 

TTTAGGAATT AATAGCTTAG GAGATCTTTT TCCGGCTTAT GGATCTTATT GTAGACCTCA 15240 

ATCAGCAGAT GGTAACATTT CAACTGCACC CATAACTAAA GTCTATCTAA ACACTACTGC 15300 

TACAGATGAC AGGGTCAGTG GAGTTACTGC AGTTGACACC GCAACCAGAT TGCATCCAGA 15360 

TGCTCATTAT ATTGAATATA CTGATGAAGC CAAAGCTACA GCTATAGGAA ATCGCCCAAA 15420 

TTATATTGGT TTCCGAGACA ATTTTATTGG ACTCATGTTC TACAATAATG GTTCTAATGC 15480 

AGGAACATTT TCCAGCCAAA CACAACAACT TAATGTTGTT TTAGACTTGA ATGACAGAAA 15540 

CAGTGAACTA AGCTATCAAT ATCTAATAGC AGATCTGACA GATAGGTATA GATATTTTGC 15600 

ACTTTGGAAC CAAGCAGTTG ATAGTTACGA CCAGTATGTC AGAATTTTGC ATAATGAAGG 15660 

ATATGAAGAA GCCCCTCCGG CCTTATCATT TCCTTCTCAA GGTATCCAAA ATTATTTCAT 15720 

GCCTACTGCG GCAGGTAATG CGATGACAGT AGACACGGGT AGAAATACTG CAGCAAAAAC 15780 

AGATAACACC AAGGCTTTTA TAGGATATGG CAACATGCCA TCTTTGGAAA TGAATCTGAC 15840 

AGCAAATCTA CAACGTACAT TTTTGTGGTC TAATGTAGCA ATGTATCTGC CAGATAGGCT 15900 
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Fig 1 (cont) 

GAAAACAACA CCACCCAACA TAAATCTACC TGATGACACC AACTCTTACG GATATATAAA 15960 
TGGAAGGGTC CCTCTAGCAA ACATAATAGA TACATGGACT AACATTGGGG CTAGGTGGTC 16020 
ATTAGATGTT ATGGATACTG TAAATCCATT TAATCACCAC AGAAATTCAG GACTAAAGTA 16080 
TAGGTCACAA CTGTTAGGAA ATGGAAGATA TTGCAGATTT CACATTCAAG TACCTCAAAA 16140 
ATTTTTTCCT ATAAAAAATC TTTTGTTGCT GCCAGGAACA TATAATTATG AATGGTACTT 16200 
TAGAAAGGAT CCCAACATGG TTTTTCAGTC TACTTTAGGT AACGACCTTA GAGCAGATGG 16260 
CGCAACTATT ACATACACCA ACATAAATTT ATATGTTTCA TTTTTCCCTA TGAATTATGA 16320 
AACAGTAAGT GAACTTGAAT TGATGTTGCG TAATGCTACT AATGATCAAA ACTTTGCAGA 16380 
TTATTTGGGT GCGGTAACTA ATCTTTATCA AATCCCAGCT AAT AC AAA T A CTGTAGTAGT 16440 
GAACGTACCA GATAGATCTT GGGGTGCTTT CAGAGGATGG AGTTTCAATA GAATTAAAGC 16500 
TTCAGAAACA CCTATGATAG GAGCAACAAA AGATCCAAAT TTTACTTATT CAGGATCTAT 16560 
ACCGCTACTA GATGGTACTT TCTATTTAAC ACACACTTTT CAACGAGTTT CTATTCAGTG 16620 
GGATTCTAGC GTTCCATGGC CAGGAGATGA TAGGCTTTTG ATTCCAAATT GGTTTGAAAT 16680 
TAAGAGAGAT CCTAATATGG ACGCAGAAGG TTATACTATG AGTCAAAGTA CTATCACAAA 16740 
AGATTTTTAT TTGGTACAAA TGGCTGCTAA TTATAATCAA GCTTATCAAG GTTATAAATT 16800 
GCCAGTACAT TCTAAATATT ATGGATTTTT AGAAAATTTT CAACCTATGA GTCGCCAAGT 16860 
ACCAATTTAT GGTAATGGCA CTTATGATTT ATATACTGCT TATATTACAA ACCAAAGAAC 16920 
CATGCAAATT TGGAATAATA GTGGTTTAGA ATCTAAAACT TCAAATCCTC CTATGTTATC 16980 
CAACACTGGT CATCTTTATG TAGCTAACTG GCCATACCCT TTGATTGGAC CAAATGCTAT 17040 
TGAAAACCAA C AAAC T G AAA GGAAATTTTT GTGTGATAAG TATATGTGGC AGATACCATT 17100 
TTCTAGTAAT TTTTTGAATA TGGGTAATTT AACAGATTTA GGGCAAAGTG TTTTGTACAC 17160 
TAATTCTAGT CATTCACTTA ATATGGTTTT TACTGTGGAT AGTATGCCTG AAACAACTTA 17220 
TCTAATGCTT TTATTTGGTG TTTTCGACCA AGTTGTTATT AATCAACCAA CAAGAAGTGG 17280 
AATAAGTGTA GCTTATTTGC GCCTTCCTTT TTCAGCTGGT AGTGCAGCAA CATGAGCGGC 173 40 
ACATCCGAAA GTGAGCTGAA AAATCTGATT TCATCATTAC ATTTAAATAA TGGATTTTTG 17400 
GGCATTTTTG ATTGCAGATT TCCAGGTTTT CTGCAAAAAT CTAAAATTCA AACTGCTATT 17460 
ATTAATACAG GTCCCAGAGA ACAAGGCGGA ATACACTGGA TAACATTAGC ATTAGAACCC 17520 
ATTTCTTATA AGCTATTTAT ATTTGATCCA CTCGGATGGA AAGACACTCA ATTAATTAAA 17580 
TTTTATAATT TTTCACTAAA TTCTCTTATT AAAAGGTCGG CCTTAAATAA CTCAGACAGA 17640 
TGTATTACAG TAGAAAGAAA TACTCAAAGT GTTCAATGTA CCTGTGCGGG ATCGTGCGGC 17700 
TTGTTTTGTA TATTTTTCTT ATACTGTTTT CACTTTTATA AACAAAATGT ATTTAAAAGT 17760 
TGGCTTTTTC AAAAATTAAA CGGTTCAACC CCTTCTCTGA TCCCATGTGA ACCACATCTA 17820 
TTACATGAAA ACCAGACATT TCTTTATGAT TTTTTAAATG CAAAAAGTGT TTATTTTCGA 17880 
AAAAATTATA GAACATTTAT TGAAAATACT AAGACTGGAT TAATAAAAAC ACATTAATTG 17940 
TATTCTTGCT TTTTGACGTT TTCATTAGTC TTCATCTTCA TCTTCTTCTT CACTGCTAGA 18000 
TTCCAAGATG GTTTTTTTTT TCTTTGATGG AGTAGGCTCT TCAATAGTTC CAAAAGGATT 18060 
CATATCAGAA TCCTCTTCTA TGTTAGGCAA CATAGTATTT TTAACCTGGA ATGACTGATT 18120 
CCACTTAAAT TGAGAAAACT GAATTGGAAT GTTATTTCCC ATACATTCAT TCCAAAATTT 18180 
ACGCACAAGA GTTAAACACT GTAACATATC TGGCAAGCTA ATTTTCATCT CACAAAATTT 18240 
TCCATTATTA CGTCTCAAGT TGTATTGATA GTTACAACAT TGAAACACAA AAACAGCAGG 18300 
GAATGTAACT GCTGCGGCCT GAACTCTATT AACATCCTGA ACATCAATTC CTTCCACTCC 18360 
AGATATAGAA AATGGAGTTA TTTTAGGGAG TTGTTTTCCT ATTGTTTGTT TGCCACCATA 18420 
ATTACATTCA CACTGACCCA ATATAAAAAG CATATTTCCG ACTTTAGCTT TCGGAAACAC 18480 
AGCTTTTGTA GTTTCAATGG CATTTTGCAT AGCCAGCAAG GCCTTCTTTT CATCTGAAAA 18540 
GTTAAGACCA CAACTGCGAG GAGAACATTG CCCAAAACGC TGATGGGCAT CCTCAGCACA 18600 
TAACACGTAA TGTTCCTGAA CTATTTTTAC TACTTGTTTA TTCATACGCC CATTACTAAG 18660 
AACACCCCTC CCTTCCTTTA GGGCTTGCAC CCCTGCTTCC GATGTTGGAG GCATTTCAAT 18720 
TTCATTCACC CTTTTAAACA TGAAGTCACC ATGAAAACAT CTAGGACGGT CCTCCTCCCA 18780 
ATCATGATAC CACAAATAAC AACCAGAAGC ATTAAAGTTT GGAATCAAGT CAATTTGCTT 18840 
ACAAATTGCA CTATATAGCA TTCTACCTCC TACAGTAGCC ATAGATTTAC TGCTACTATA 18900 
AGTCAAATTT ATAATTTTCA TCTTTTTCAT GTACTGAGCA AATAATTTTT CACAATCTCC 18960 
TTCTTCAGGA TGAAACTTCA TTTGACTGGT ATCAACTTTA ACACACTCTC CAAATTTAGC 19020 
TAAAATTTCG AGCGCCGCTT GAACTTTATT CTGAAATTCT TCTGTAGTAG ATTTTCTCTT 19080 
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Fig 1 (cont) 

CTTGATAGAT TTAGTAACTT TTTTAGAAGA CATTATGTTA GTTTTTTTCT CGTTGTAGGA 19140 
TGGCTGAAAA AAATATGGGA GAGTCAGAGA AGGGTTTGAA CGAAGAAGAA TTTAACTCTA 19200 
TTCTATCAAA ACATCTGGAA AGACAAATTA AAATCTGTAA AGCGTTAACA TCAAAATTAT 19260 
CGAACTGGAA TATTGGAACA TTGTTAGAAA ACTTGTTATT TTGTCCTGAT GAAAGACAAT 19320 
CATCAGGTGA TCCCGACCCA AAACTAAACT TTTATCCGCC TTTTTTAATT CCGGAATGTC 19380 
TTGCATTGCA CTATCCATTT TTTCTAACAA CTCCTATTCC GCTATCATGC AAAGCGAACA 19440 
AAATAGGAAC TAACACTTAC CGAAAATGGA TGAACAATCA AGTCCTGGAT TTACAAATAC 19500 
CTTCCTTGGA AAATTGCAAA TGGGATGATA GCTTGGGAAA TGTAGATTTA ATTGAAGAGC 19560 
TTAAAGAGAA CCAAAAACTT GTTTTAGTAA AACAAGACCA TGAAAGAAAT ATATGGTTTA 19620 
AATCAAAATG CAAACAACTT CAAAGTTTCA GCTATCCCTC ACTCAGTCTG CCCCCAGTTT 19680 
TACAACAAGT TTTAATTGAA TCTCTTATCG GCATTAGTCA GGATCCTAAT AACTTTGACA 19740 
AAAATTACGA ACCTGCAATA ACTCTAGAAA AACTACAACA TGTAAACTGT GATCAAGATT 19800 
TAAAACAAGT TCAACAAAAA GTATCTTCAG CCGCTACATA CGGAATACTT TTGAAATGCA 19860 
TTCAGACTTT ATTCAGTGAC AAATTATTCA TTCAAAACTG CCAGGAATCA T TAG ATT AC A 19920 
CCTTTAACCA TGGTTATGTA AAATTACTTC AATTTTTGAC AAATGTCAGT TTAAGCGAAT 19980 
TTGTAACTTT CCATGGTTTA ACACACAGGA ACAGACTCAA TAATCCGCAG CAACATACAC 20040 
AATTGGCAAC CGAAGACAAA ATAGACTATA T CAT A GAT AC AGTGTATTTA TTTTTGGTAT 20100 
TTACGTGGCA GACAGCAATG GATATTTGGA ATCAAACATT AGATGATAAA ACAATAAATA 20160 
TAATTAAAGA GGAATTAAAC CAAAATTTTG AGAAAATTGT CAAAGCTGAA TCAGTTGATG 20220 
AAGTTTCTGA AATTTTAAAG TCTATTATTT TCCCTGAACT CATGCTGCGA GCTTTTTGTT 20280 
CTAATTTACC T GAT TT TATA AATCAGAGTC AGATATCAAA TTTTAGAAAC TTTATCTGCA 20340 
TTAAATCCGG CATACCGCAG TCAATTTGCC CCCTATTACC TTCAGATCTA ATTCCTTTAA 20400 
CTTTCCTAGA AAGTCATCCA ATACTCTGGA GTCATGTAAT GTTACTAAAT CTTGCTTCAT 20460 
TTCTAGTAAA CCAAGGCAAT TATTTGCATG AACCCGAAAA ACCTTTAAAT ATTTCATCAG 20520 
TTTACTGTAA TTGTAATTTA TGCTCTCCGC AAAGAATGCC ATGTTACAAT AGCAGTTTGA 20580 
TGCAAGAAAT ACTAACCATT GATAAATTCG AGTTCACAAA CTCTGATAAA ACAAAACAGC 20640 
TAAAACTGAC CCTCCAAACT TTTGCTAATG CCTATCTTAA CAAATTTAAC TCAGCAGAAT 20700 
TCTACCATGA CCAAGTTTTA TTCTACAAAA ACTGTAAAAG TAAATTTTCT AACCAATTAA 20760 
CAGCTTGTGT AATAAAAGAC GAAAAATTAT TGGCTAAAAT AGCAGAAATT CAAATAACGC 20820 
GGGAAAAAGA ACTCTTAAAA AGAGGAAAAG GAATTTATTT GGATCCAGAA ACAGGAGAAA 20880 
TCTTAAACAA TGGAGAAGCC ATATCATCCT CTGAAAACTT CCAAAGGCAA AGAACTAGCT 20940 
ATGCTCTACC ATCAAATGAA GGAGAGCGAG CTGGATGGGA AGCCGATGAG CGAAGAAGAC 21000 
GAAGGAGAAG TGAGTGAGGA TGAAACAGAG ACAACAATTC CAAAGAAAAT GAAGTTTACA 21060 
AGTAAGTAAG CTCTAAATTT TTTATATTAA AAACTGAATT TTTTTAGACA AAATTATTTT 21120 
AAATTAAATC TTTATAGCTA GCAGTTGATC TTTGTTCGTT TTTCAGAAAA CTCAAGTGTT 21180 
CAGTCATATC AAGTTCACTT GCCTCTGAAA CACGAAATTG CGGAAATTCT AGAAAAAATT 21240 
AGACTAGAAT CTAAAAAATA TCCAGGAAAA GTTTATCAAA TAAGAAATAG AACTCCAGCA 21300 
AGTATTACAA AACGATACCT GTATGAAAGA GATCTGAAGA AACTGTTCCA GTATCTAGAA 21360 
GACGCAAAGA AGCTTTACGC TAAGTACCAA AGCTGAGGCT TTATAGTTTT AAATTTTCCC 21420 
GCCATGGCTC AACCAGTGAC GCCTTACGTC TGGAAATACC AACCAGAAAC AGGATATACT 21480 
GCTGGAGCCC ATCAAAATTA TAACACTGTT ATCAACTGGT TGCATGCCAA TCCACAAATG 21540 
TTTGCCAGAA TTCAACATAT AAACACCGCA CGCAATGTTA TGGACAAATT CCGCTCTGAT 21600 
TTGACCCGAG ATGACATCGC GGTTAACATC AACAACTGGC CTGCAGAGGA TTTAATGCAA 21660 
CCTCCTAATT TTCCTTACAT TCCTGCGACC TCTAAATCCG CTTCAACCAT AAATGACTGG 21720 
TTGGCTACCA CTCAAGGAAT TCAACTCAGT GGAACTAGTG AACTAAACGG GTGGGGATCT 21780 
AACCGCCTGA CTTCCTATCC GGATATTCCA CCCATTTTAA AGTATGAAAG GCCTGGTCAA 21840 
CAACTTCAAG GCCAAGGACT TTTTAAGCAA GAAAATATTC ATTTATTTTA CGAATCTCCG 21900 
CGCCTCCCTC GCTCTGGAGG ATTAACTCCC CAACAATTTG TAAAAGAATT TCCGCCTGTT 21960 
GTTTATAATA ACCCCTTCTC AGAATCTATG AGTGTATTTC CGAAAGAATT TAGTCCTTTG 22020 
TTTAACCCTT CAGAATCTTT GAAAAAAACA TCCAGTCAAA CTTTACAATA TAAATAAAAA 22080 
ACTTCTATTG ATCTTTATAC TTACACTAAA GCATCGCGTT TATTTTCGTC GCCATAAAAA 22140 
TATATCAAAG ACCCGTAATT CTCTAACTTT AAATCATTTT TTGAACTAAT CTTAATCCAT 22200 
TTAAATGTAG GAATTAATAT ATCAGAAACC AGTAACAAGC CAGAATTAAA ATATACTTGT 22260 



WO 96/03508 



PCT/AU95/00453 



8/23 
Fig 1 (cont) 

GTCATTTTTA CAGATGAAGC GAGCACGCTG GGACCCGGTT TATCCCTTTT CTGAAGAGAG 22320 
ACTGGTTCCT CTGCCTCCTT TTATTGAAGC CGGAAAAGGG CTAAAAAGCG AAGGGTTGAT 22380 
CTTATCTTTA AACTTTACTG ATCCTATCAC TATAAATCAA ACCGGTTTCT TAACTGTAAA 22440 
ATTGGGAGAT GGAATATTCA TAAACGGAGA GGGTGGCCTA TCAAGCACTG CTCCAAAAGT 22500 
CAAAGTTCCC CTGACTGTCT CAGATGAAAC ATTGCAACTG CTATTAAGTA ATTCTCTAAC 22560 
AACTGAGTCA GACTCTTTAG CTTTAAAACA ACCGCAACTT CCCCTAAAAA TAAATGATGA 22620 
GGGGAGTTTA GTATTGAACT TAAATACTCC TTTAAATCTA CAAAATGAGA GATTGAGTTT 22680 
AAATGTTTCA AATCCACTAA AGATAGCGGC AGATTCTTTA ACTATAAACT TAAAGGAACC 22740 
CCTAGGATTG CAAAATGAAA GTTTGGGCTT AAATCTAAGT GATCCTATGA ATATAACTCC 22800 
AGAAGGAAAT TTAGGTATTA AATTGAAAAA TCCTATGAAA GTTGAAGAAA GTTCTTTAGC 22860 
CTTAAACTAT AAGAATCCTC TCGCCATTAG TAATGATGCG TTAAGTATAA ACATTGCGAA 22920 
TCCATTAACT GTTAATACAA GCGGATCTCT AGGAATATCT TATTCTACTC CCTTACGAAT 22980 
TTCAAATAAT GCTTTATCAT TATTTATAGG AAAACCTTTA GGATTAGGAA CTGACGGCTC 23040 
TTTAACTGTA AATTTAACTA GGCCTCTGGT ATGTCGTCAG AACACTTTGG CCATAAACTA 23100 
CTCAGCCCCA CTAGTGTCAT TGCAAGACAA TCTTACTTTA AGTTATGCTC AACCATTAAC 23160 
TGTAAGCGAT AATTCTTTAA GATTGTCTCT AAATTCTCCA CTAAACACAA ATAGTGATGG 23220 
AAAACTTAGT GTAAACTATT CTAATCCTTT AGTTGTGACT GACTCTAATC TTACCCTCAG 23280 
TGTTAAAAAA CCTGTAATGA TTAACAACAC AGGTAATGTT GACTTAAGCT TTACAGCTCC 233 40 
CATAAAATTA AATGATGCAG AACAGTTGAC TTTAGAAACC ACTGAGCCCT TGGAAGTGGC 23400 
CGATAACGCT CTAAAACTGA AACTTGGAAA AGGCTTAACT GTTAGTAATA ATGCTTTAAC 23460 
CTTAAACCTT GGAAACGGTT TGACTTTCCA ACAAGGTCTT TTACAAATTA AAACTAATAG 23520 
CTCTCTAGGG TTTAATGCTT CTGGGGAATT ATCAACAGCT ACAAAGCAGG GAACCATAAC 23580 
CGTTAACTTT CTAAGCACAA CTCCTATAGC TTTTGGGTGG CAAATAATAC CTACTACTGT 23 640 
AGCTTTCATT TATATTTTAT CAGGAACACA ATTTACTCCT CAATCCCCAG TAACTTCTTT 23700 
AGGTTTTCAA CCCCCACAAG ACTTTTTGGA TTTCTTCGTT TTAAGTCCGT TTGTTACATC 23760 
TGTAACTCAA ATTGTGGGAA ATGATGTTAA GGTTATTGGC CTAACTATTT CTAAAAACCA 23820 
ATCTACCATA ACTATGAAAT TTACTTCTCC CTTAGCTGAA AATGTACCAG TTAGTATGTT 23880 
TACAGCACAT CAATTCAGAC AATGAATATT TTAAAAATTC TTTATTAAAG AGTAATCTTT 23940 
TTACATACCG TTCTTGACAT AATGTGCCTC TATAATTAAC AAATCTAAGC AAGCAAGGTT 24000 
GATCATTGGA ATCTATAGAA GCATAACTCT TCCAATAAGC ATAATCATAT GGCGGTAAAT 24060 
GAAAACCCCT TAAATCTACC ATATTCATCT TTAAGTGTAC AGTATCTAAC AGGTTTTTAC 24120 
AATCTTGCAC TTCTGGACTT TTAAAAACAA ACAGTACTTT CATAGGACAA CAATTGTAAC 24180 
GGTTATAATC TGTTACAATT TTACTTATTT CTTCTTCCAA TGGCAAAGCA TTCCAAAGTC 24240 
TTGTTATAAG TACTGTAAAA TCATCAAATG AATAACATAA CACATTTGTA CAACAATTGG 243 00 
TCCAAGGTAA AAAAACAGGC ACACGAACAT GAACTTTTTT TAAAATTAAC ATCAGTGTCT 24360 
GTTTTAAACT TTGACATTGC AAAGAATTTG GCTGCAAGCA ATGACAATGA AATTGATTTT 24420 
GCTGACAAGG TAAGTCACAC AAATACAACT TTAACAGCCT AAATATAACA ACATTAATGT 24480 
AACTTTCCAA GACTTTAAAA CTAACAAACG GTATATCACA ATAAAAAAGA TGATGAATCC 24540 
CTTCGCAACA CATAATGGAG TTCATGCTAC ATCCAAAGAT GGTTCCGACA AACCTCTGTA 24600 
AATTAAAGAA CAACAATACA ACATACGAAG AAAATTAAAA CGTTTTTCAA AACGAGATAT 24660 
ACATTGCTGC AAAGTATCTG AACATTTACA TTTTATACTT ATAAGCTCAC AAGTTTCAGA 24720 
AAATGTAATT CGTTTAACAG TTTGATATGA ATACCATTTT GAAGAAAAAT AGAAATAGTT 24780 
TTGTGCATTT GTAAGCTCCC AGAAACATTA ACGGACAGGC AAATCCAAGT ATTACAACAA 24840 
ACAGGAACAG TCTTAACGTT TCGTTCAGAA AACAAAGTAA CAGGCATATG ATTAAAGCAA 24900 
GACAATAAAA CACTTTTGGC AGCTAAACAT TGCAAAGATC CAGGTGAATT ACAATGACAA 24960 
TGATAATAAA ACTTATAAGC CATATCGGCC CTCTTGCAAA ACGAATCAGC TTTTTGGCTT 25020 
ATAGGAAAAT AACAAAAAAA CTGATTATAT ATGAATGGAG TTAATATCTT CTTCAAATTA 25080 
TACACACGAA TAGCAGAACC AAGACGACCA CGCCCAACAC AGGTAAATAT TTCAAGTCCA 25140 
TGACTAGGAA CAGATGGTTT CTCACAAGCA ACAACTTTGA TTTGCTTATC CATCACTGCC 25200 
AATCAGGCTT AATAGGAAAA GAAGAAAAAT AATTTTCCCA ATAATAACGA AAGAAATTCC 25260 
ACGTTTCATC CTGTACATTA CTAGTCACAA ATACAACCTC CGCTATCAAA GATTCCCTAT 25320 
CATTTAAAAC TCCCACCAAA TTGTCCCAGT CTACCTCAAA AAAGCCAGTT CCCATATTTT 25380 
CAAAATTTGC CCATTTTAAA TAATCCAAAG CATCAAATTC AGGAAACAAA TCTTTCTGAG 25440 
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Fig 1 (cont) 

CTAAAACATA TACAGTTTTA TCGCCATTAA ATCTAAAAGC CATCCTAAAT GGACCTCTAG 25500 
CCCAGTAGTT TAAGTACCGG GAAGAGACTA TACAATATAC TTGATATTGA TGTCTGTTAA 25560 
GTGGTGATAA AAAAGAAAGT AATTCAGAAT TAGGATAAAG CATTCTCCCA TGTTGATTCA 25620 
TCTACAAAAA ACAAAAAAAT TATAAGGTTC ATAGAAAACC TACTATTTAA CAAATCTATA 25680 
AAAATGCATT AAAAAGTTAC CTTGAATATA AATTCAGATC ACCTAAAAAA CGAAAAAAAA 25740 
TAACATTTAT GTTAGTAAAT GATAGTCTTT AAAAATTAGA AAAGAATCAA GTCGCTTTTA 25800 
TACTTACAAA CTCCAAATAA ATTCTGTAAC CAAGAGAAAA ATTGTAACCT AAAAGGTAAA 25860 
GAAGAACATT ATAAGATTAA AACCACTCTA AAATCTGAAA AGCATTATGA AAAATTCTGA 25920 
TAGCTGCAAC TTACTAGTCT TCTCCAAATG TTGCAGGCAT TTCAAAAAAT CAAGAGGAAA 25980 
ACCGGAGTTT ATAAAGTAGT AGTCTGATTA TATCTGAAAA AGTTTAACTT CCTTTTCAAC 26040 
CCAACCCAGT CCAATAAAAT TCCAACCTTA ACTTCTTTCC TGCTAAAACT CCATAAAAGT 26100 
CCAATTACCA CTTGACTTTT ATTTAACCTC AATTATGTTA CATGTTATTC TACCCATAAA 26160 
AACTTGATGA CCAAGAACTG ACCTTTCCCA TGTTTTTCTG AAATAACAAA AATGTTGATT 26220 
TAAAGATTTT TAACTACCCA AAAAACCCGC TCTCATGATT TTTTCTTATA TAAACAGGAT 26280 
ACAAAAGAAC TGGCAAAGAT ATTCCATCAT ACTTCTCCAA CTGTCAAAAC ATACCACTTA 26340 
ACCTCTCCCA TGTTTTTTCC CTTTTGCACA AACAGGATAT AAAAAATATT TTTGCCACAA 26400 
TGTTTTTCCT TTTACTCAAC TGCCAGAATA AAAATGAACA GCTTAACCTT TTTCCCTCTT 26460 
AACCCATTGC GTTCCTCTAA GAAAAAAATT ATCCCGCCCA ATATGCTAAA GGCTTCTCCC 26520 
GCCAAAACAG CTCAACTTAA AATCTCTCAT GAATAAAACC CAGAGAAAAT TTCCAGTAAT 26580 
AAAAATTAAT AACCGTGAAG TACTAGATCT AATAATGATA TTTTGAACTC ATAAAAATCC 26640 
ACCATCCATG TAATGTTACA AACACTTTTT TATTGAGTTT TTTCTTACAA CTGCATTACA 26700 
TACAGGCCAA GCATCAAACT TTCTTCTGTA TTTCTTCCTA GACCACAAAA TTACAGACTT 26760 
ATATTTCTGC CACAAATCTC TATGATCTTT ACAGTAACAC TTACATTTAA ATGGGGAATA 26820 
CAGCAGCAAA TAAGGATGAG TTAAACATGC GATACAATGA CCAGAAGGAA GATAATACAA 26880 
TACATCACAC CAAAATGAAG GTACAGACAA CATCGCATGA AATCTTAAAT GTGATTTTAC 26940 
AATAAATTTC TGCAGCAGCT TACAATCTAT ATTAGCAAAC CGTTTTATAT ACAAACATAA 27000 
AAACTTGGAA CTTTTCACCA ACTCAATCAT GTTATTATAA CACATTACAA ATTTTGCTAT 27060 
ATCTTTATTT GTCAAATAAC AAAATATCTC AATCCACAGC TCATCTGGCA GCAAACTTCG 27120 
CAAATCCATG ACCTGTAAAA GATACAACAG AAAACAGAAA ATTAATGCCA TTCAATAACA 27180 
TAAAAAATAC AGTCAAATCA CATACTTTTT CTCACTTACA AAACTTTGTG AGCAGGCCTC 27240 
CAAAACAAAC TTCAGAAAAT GGATGCATAC AAGAACATTC TCCTCTCAAA AATTGCTTTA 27300 
ACTGAATGCG GCATTTTGCA CCTCCAGAAA AATGCAGTCC ATTGAGAGGC TCTTCTCTTA 273 60 
AAACACAGAA ATGCTTCTGC AAAATCTGTA AAGAAACTAA CAACTTCCAA ATTCCAATCA 27420 
TCATGCATTG CAAAGAAGGA CATTCAACAG CAAAAGGATC GTGATGAGCC AATAAAGCTT 2748 0 
TACTGTATGA CTCATTTTCA TGAATTACAG TCTGTAACTT ACTATAATGC ATTTTAAGCT 27540 
CTGCTTCACA AATTAATAAT GCTAATTTCT TTAAGCAGCT CAAAGAAAAC TCATCAGGAC 27600 
AACGGCATTT AAGAAAGCAA CAAAATGATT TCTTAAAATA CATTTTTCCA GCATGATGAA 27660 
CAATAAAAAA TTTCAACGTT AAACAATGCA AAAATGCATT TTTATGCACA GTGAAAGTAA 27720 
TTTTTTCAGC TGAAGCTAAA TCACAGCCTA TTTTATTACA TGATTTTGTA TGCTCCAAAA 27780 
GAGCTTGTTT TAATTGCTTC AAATCCATCT TCTTACAATT TTTTCTTTTT ATAAACACCA 27840 
GAACCGCATT CAGGCCAATT CCAGTTATTG TTTAAATTTG CTACAGAAAC TGCAGACCAC 27900 
AAAACCACAT CCTCTAAATC AACCCACAAA GATCTATGAT CCACACAAAA ACACAAAGAA 27960 
TGATACGGAG AATACAACAA TAAATGGGGA TTAACAAGGG ACGCAACACA ATGACCCGAA 28020 
GGTAATAAAG TTTTACAGCA CCAATTACAA GCAACAGGTA AT G GAG TATA TTTCCCAATG 28080 
CGACGAGAAA GCCGAATGTC ATTCAGAACA GCATTGCATT TTATCTTCTC AAACCTCTTA 28140 
AGGTGCAATT GTATAAAATA AGAATCCTTA ATGACAGTGA TGAATTGAGG AAAAGCAAAA 28200 
ACAAAACTAG CAATGTCTTT GCTTGTAAGT TTCAAAAATA TCTTCATCCA AATCTCAGTC 28260 
GGTAATTCAA CAAAAAATTC AGGGGCCTAC AAAATTAATC AGACTAATTT AATATCATCT 28320 
TGTAAACAGC GAAAAGAAAA AATAACACAC CCAAAAATAA AAAACTCTTA CCCCTGTTAT 28380 
CCATCGAGAT ACACAGAAAA ATTCAGAACA CTCAGTGTCA TGTTTCTTAA ATTGTTCCCA 28440 
AAGCTCAGAC ATTCTAAGCC AAAAATTTTT TGAGAACTGC AAAAACCCAG TTTTTATAAC 28500 
AAAGCCTTAA TGTTTTCTTA ACTGATTTAA CTGCCCTAAC AGGAACTCCA CATTCCGGCC 28560 
ACCGCCACCC AGGGGACAAA TCTTGCCAAG AACTACAAGT CCATAAAACA ACATCCTGCA 28620 
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Fig 1 (cont) 
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